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ABSTRACT 

We present a direct measurement of the metallicity distribution function for the high redshift inter- 
galactic medium. We determine the shape of this function using survival statistics, which incorporate 
both detections and non-detections of O vi and C iv absorption, associated with H I lines in high 
resolution quasar spectra. The O vi sample (taken from 7 QSOs at z a bs ~ 2.5) contains lines with 
Ah i > 10 136 , encompassing ~ 50% of all baryons at z ~ 2.5. Our survey accounts for * 98.8% of the 
C iv mass and « 90% of the O vi mass in the universe at this epoch. We find a median intergalactic 
abundance of [0,C/H] = —2.82; the differential abundance distribution is approximately lognormal 
with mean ([C,0/H]) « —2.85 and a = 0.75 dex. We discuss the sensitivity of these results to the 
assumed form of the ionizing UV radiation field. Some ~ 60 — 70% of lines in the Ly-a forest are 
enriched to observable levels of [O, C/H] > —3.5, while the remaining ~ 30% of the lines have even 
lower abundances. Thus we have not detected a universal metallicity floor as has been suggested for 
some Population III enrichment scenaria. In fact, we argue that the bulk of the intergalactic metals 
formed later than the first stars that are thought to have triggered reionization. We do not observe 
a strong trend of decreasing metallicity toward the lower density IGM, at least within regions that 
would be characterized as filaments in numerical simulations. However, an [O/H] enhancement may 
be present at somewhat high densities. We estimate that roughly half of all baryons at these redshifts 
have been enriched to [O, C/H] > —3.5. Using a simple "closed box" model for the metallicity evolu- 
tion of the IGM, we estimate the chemical yield of galaxies formed prior to z ~ 2.5, finding that the 
typical galaxy recycled 0.1 — 0.4% of its mass back into the IGM as heavy elements in the first 3 Gyr 
after the Big Bang. 

Subject headings: cosmology:miscellaneous - galaxies:formation - intergalactic medium 
quasars: absorption lines 



1. INTRODUCTION 

Observations of C iv absorption in QSO spectra 
have unambiguously revealed the presen ce of heavy el- 
ements in the Ly-a forest at z = 3 $M eyer_&L-Yoxk 
| 1987UCowie et alll995allEllison et alJ2000HCowie et al. 
1995b* IRauch et aU 11 997aL 119961 ISc have et al.l 12003V 
These results have been interpreted as evidence of 
widespread enrichment of the universe with the byprod- 
ucts of stellar nucleosynthesis, but the data at present 
permit the possibility of several enrichment mechanisms, 
from an early [z > 10) generation of m assive short- 
lived stars (e.g.. lOstriker fe Gnedml Il996(l to more re- 
cent (z < 7) pollution by winds from star form- 
ing galaxies iAdelberger et alJl200.1 lAguirre 7TaTll2"00Tl 
iSpringel fc HernauistJ l2003). Measurements of the IGM 
metallicity can usefully constrain the history of star for- 
mation and stellar recycling on cosmological scales. How- 
ever, at a limiting associated H I column of ~ 10 145 

1 The observations were made at the W.M. Keck Observatory 
which is operated as a scientific partnership between the Califor- 
nia Institute of Technology and the University of California; it 
was made possible by the generous support of the W.M. Keck 
Foundation. 

2 Palomar Observatory, California Institute of Tech- 
nology, Pasadena, CA 91125, USA; ras@astro.caltcch.edu, 
wws@astro.caltech.edu 

3 Carnegie Observatories, 813 Santa Barbara Street, Pasadena, 
CA 91101, USA; mr@ociw.edu 

4 Current Address: Pappalardo Fellow, MIT Center for Space 
Research, 77 Massachusetts Ave. #37-664B, Cambridge, MA 
02139, USA; simcoe@mit.edu 



cm~ 2 , existing C iv measurements only probe regions 
with gas overdensities of p/p > 7 relative to the cosmic 
mean. This corresponds to the densest ~ 15% of the 
baryons in the Ly-a forest, which occupy < 1% of the 
total volume of the universe. 

At H I column densities below 10 145 cm~ 2 , 
C iv searches are limited by signal-to-noise ratio even 
in the best available spectra. A decline in the over- 
all carbon column in low density gas is exacerbated 
by the increasing ionization state of the gas, which fa- 
vors C v and higher levels in the more tenuous re- 
gions of the IGM. To combat this effect, several stud- 
ies have employed sensitive statistical techniques to ex- 
tract averaged estimates of [C/H] at lower densities. 
These studies generally fall into two categories: cither 
a shift-and-stack approach to g enerate a high signal- 
to-noise, composite C iv line l)Lu et alJ 1199 8V or a 
pixel-by-pixel statisical analysis of the relative optical 
depths of H I and C IV throu ghout the run of a spec- 
trum llCowie fe Songailal 119981: ISqngaila fc Cowiel 119961 
lEllison et all 120001: lAguirre et all 12002(1 . The reader is 
referred to lEllison et alJ l|2000(l for a review of these two 
methods. Over time, pixel methods have seen more ex- 
tensive use, although they can be difficult to interpret 
without recourse to simulations, and in our experience 
they have been somewhat sensitiv e to slight systemat- 
ics (e.g. continuum pla cement). ISchave et al.1 l|2003fl 
and lAguirre et aD l|2003fl have recently compiled com- 
prehensive studies of pixel statistics which address many 
of these issues, using numerical simulations to trace the 
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distribution of intergalactic carbon and silicon with den- 
sity and redshift. 

In this work, we describe a new effort to pursue metal- 
licity measurements in the IGM to near the cosmic mean 
density through observations of weak vi absorption in 
a sample of 7 QSOs. This is supported by new observa- 
tions of C iv absorption in high signal to noise spectra 
of two sample objects. We have limited our measure- 
ments to a single observed quantity: the column den- 
sity of detected O vi and C iv lines (or 3er upper limit 
on No vi, Nc xv f° r non-detections). This paper there- 
fore represents our best effort to describe the enrichment 
of the IGM using the direct, local properties of individ- 
ual systems. This approach accepts a slight sacrifice in 
sensitivity over the statistical methods quoted above, in 
return for a localized description of the metallicity field 
and a straightforward assessment of possible systematic 
biases. It also allows for an easy division of the total 
sample into subsets to study trends of metallicity with 
other external variants, such as H I column density, or 
proximity to galaxies (though the latter is beyond the 
scope of this paper). 

The literature contains several examples of intergalac- 
tic O vi searches, as it has long been known that its 
ionization potential and high abundance ar e very favor- 
able for production in the low density IGM ijNorris et al.l 
IT9S^ lOha.ffee et ahllTflS^ 5 . The principal hurdle faced 
by these searches is the unfortunate rest wavelength of 
the O VI doublet (1032,10371). This locates the weak 
O vi lines deep in the Ly-a forest, which smothers the 
signal of many real systems and further introduces a 
number of false positive identifications. Many of the 
O vi systems discovered in these surveys display large 
equivalent widths and rich chemical str ucture, as would 
be expected in galactic e nvironments ijBurles fc Tvtlerl 
119961 ISimcoe et_alj j2002j). Usin g a single spectrum of 
Q1422+2309. IDave et all l|1998j) statistically explored 
the O VI content of the low density IGM at z ~ 3.5, 
finding evidence for d eclining abundances in lower den- 
sity regions. However. ISchave et alJ l|2000j) have claimed 
a detection of O vi even at very low den sities, reaching 
below the cosmic mean. More r ecently iCarswell et alJ 
( 2002) and iBergeron et alJ (|2002fl have discovered evi- 
dence for O vi enrichment in a number of individual sys- 
tems with iV H i ~ 10 14 - 5 at z ~ 2.3. 

This paper expands upon these studies using an in- 
creased number of sightlines observed at high resolution, 
and also by incorporating the full use of non-detections 
in the analysis. These additional measurements have al- 
lowed us to use the methods of survival analysis to con- 
struct the distribution function of [O/H] and [C/H] in 
the Ly-a forest. We shall demonstrate that our measured 

5 N.B.: Searches for weak metals in the high redshift Ly- 
a forest are qualitatively different from the recent low redshift 
O VI surveys undertaken with FUSE and STIS ITripp &: Savage) 
l20MlTrTp"p et al . 2001; Savage et al. 2002). The local, "warm-hot" 
variety of O VI is usually collisionally ionized through interactions 
with galaxies, or by accretion shocks as pre-enriched material falls 
onto large scale structure. The weaker high redshift population 
should be predominantly cool er gas that is photoionized by the 
integrated light from quasars (Rauch ct al. 1997b; Carswcll et al. 
120021) . although we reported in 15rmTOe^t^d^ " l2002h on a separate, 
co-existing population high redshift systems that may be collision- 
ally ionized. Photoionized O VI syst ems are physically distinct from 
the "warm- hot" phase of the IGM IDave et al1l200ll) . 



Table 1. Object Summary 



Object z em Azov/ 1 

Q1009+2956 2.62 2.295-2.553 

Q1217+4957 2.70 2.374-2.635 

Q1347-2457 2.53 2.329-2.525 

Q1442+2931 2.63 2.270-2.556 2 

Q1549+1919 2.83 2.273-2.767 

Q1603+3820 2.51 2.201-2.415 3 

Q1700+6416 2.72 2.259-2.651 



1 Corrected to exclude regions within 5000 km/s of the QSO 
emission redshift. 

2 The region 2.41 < z < 2.46 was excluded for this object 
due to the presence of a weak Damped Ly-a system in this 
wavelength range. 

3 The region 2.415 < z < 2.450 was excluded for this object 
due to the presence of a strong system which appears to be 
ejected from the QSO. 

distribution function agrees quite well with very recent 
models produced f rom the completely different method 
of pixel statistics ijSchave et al.ll2003j) . In Section [5] we 
describe the observations, sample definition, and mea- 
surement methods; Section |31 explains the conversion of 
column density measurements to abundance estimates 
and the application of survival statistics to estimate the 
[O/H] distribution; Section 0] discusses the cosmological 
implications of the observed metallicity distribution. 

2. OBSERVATIONS 

Our search targets 7 bright QSOs in the redshift range 
2.5 < z < 2.8, which was chosen to balance three 
important factors regarding the existence and observ- 
ability of O vi : contamination from the Ly-a forest 
(which worsens toward higher redshift), strength of the 
metagalactic UV ionizing flux (which is maximized at 
2.5 < z < 3.0), and accessibility from large aperture, 
ground-based telescopes (to improve S/N for the weak- 
est lines). The sightlines are listed in Table 1, along 
with their QSO emission redshifts and the range of red- 
shift covered by the absorption line measurements. Four 
of the seven spectra w ere used in our earl ier analysis of 
strong O vi systems l|Simcoe et alJ 1200*21) . and the de- 
tails of those observations are found therein. New obser- 
vations of Q1217+4957, Q1347-2457, and Q1603+3820 
were taken in April 2002 under variable conditions; 
Q 1626+6433 was used in the previous paper but is omit- 
ted here because of the data's lower signal to noise ratio 
and small redshift coverage. The observations were taken 
with the HIRES spectrograph on the Keck I telescope, 
using the UV blazed cross disperser. All exposures were 
taken through an 0'.'86 slit fixed at the parallactic an- 
gle, for a spectral resolution of 6.6 km s _1 , and the data 
were reduced using T. Barlow's "makee" echelle reduc- 
tion package. 

2.1. Identification and Measurement of the 
O VI Systems 

Because O vi is placed within the Ly-a ,Ly-/3 , and 
higher order Lyman f orests, our first step has been to 
follow the procedure of lOarswell et alJ l|2002f ) . fitting the 
entire Ly-a forest region to remove high order H I tran- 
sitions from the data. Beginning at the emission redshift 
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of the quasar, each H I line in the forest was fit with a 
combination of Voigt profiles using the VPFIT 6 software 
package. This procedure was extended to lower redshift 
until the observed wavelength of Ly-a was equal to the 
observed wavelength of O vi at the emission redshift of 
the QSO. For the handful of cases in each spectrum with 
Nn z > 10 15 (where H I lies on the flat part of the curve of 
growth) we performed joint fits for the H I column den- 
sity using Ly-a ,Ly-/3 ,Ly-7 , and Ly-S . In all other cases, 
only Ly-a was used, as the primary goal was to remove 
these higher order lines. As a side benefit, we obtain 
a full statistical description of the forest in the redshift 
range of the O VI survey. This information is used in Sec- 
tion 3.7 to estimate the mass fraction of baryons probed 
in the sample. 

Once a satisfactory fit was obtained for the forest, we 
adjusted the original data and error arrays, using the 
model fit to remove the signal of Ly-/3 and higher order 
Lyman series transitions. Even small fluctuations around 
the continuum level were often caused by Ly-7 to Ly-7 
transitions from higher redshift systems. By removing 
this signal the useful O vi pathlength in each spectrum 
is nearly doubled, correspondingly doubling the size of 
our line sample. 

vi Sample Selection and Measurements 

For the O VI measurements, we have focused on the 
subset of systems with H 1 column densities above Ah i = 
10 13 ' 6 cm~ 2 . This represents almost an order of magni- 
tude increase in H I depth over past C iv and O vi sur- 
veys, and corresponds to clouds with overdensities of 
p/p ~ 1.6 relative to the cosmic mean at z ~ 2.5. The 
quality of the O vi data varies throughout the survey 
according to sightline and redshift, and not all regions 
are sufficiently sensitive to reveal O vi lines at the level 
expected for the weakest H 1 systems in the sample. How- 
ever, in some regions the data quality is very high, and 
we shall describe below how measurements from the best 
regions can be combined consistently with upper limits 
measured in portions of the spectrum with lower signal- 
to-noise ratios. 

We selected candidate systems using H 1 column densi- 
ties obtained from our Voigt profile fits to the Ly-a forest. 
For each line in the Ah : > 10 136 sample, we examined 
the spectrum (now free of higher order H 1 lines) for 
O vi at each sample redshift. In instances where a possi- 
ble O vi line was detected at > 3a significance, we used 
VPFIT to determine the column density and b parameter 
of the O Vi line. At this stage, systems were flagged as 
possible detections if both components of the O VI dou- 
blet were clearly visible, or if absorption was present in 
one component but the other was strongly blended with 
a Ly-a forest line. If no O vi absorption was present 
at the expected location of one or both doublet compo- 
nents, we determined 3er upper limits on the O vi absorb- 
ing column. The upper limits were measured from the 
O vi 1032A transition except in cases where Ly-a forest 
blending led to a cleaner result from O VI 1037A. 

When no O vi is detected, the measurement of an up- 
per limit on Ao vi depends upon the choice of linewidth, 
b, which cannot be determined from the data. Rather 

6 Provided by R. Carswell. J. Webb. A. Cooke, & M. Irwin - 
|htt p : / / www . ast .cam, ac.uk/~rfc/ vpfit . htrrd| 



Table 2. Numbers of O vi Detections vs. Non-Detections 



Class 


-^thermal 


^turb 


dd 


41 


41 


bd 


71 


79 


ltd 


118 


110 



than guessing at the correct value to use, we have mea- 
sured two upper limits for each system. For one mea- 
surement, we fix bo vi to the value for completely ther- 
mal line broadening in the H 1 -O VI gas mixture: 

bo vi = bn 1 x y/ ma i/too vi- The other limit corre- 
ponds to broadening from turbulent or bulk gas flows: 
60 vi = &Hi- The actual value should lie between these 
two extremes. We also fix zo vi = z Ri for measuring 
limits on non-detections; for detections we allow the red- 
shift to vary up to ~ 40 km s _1 to optimize the fit. Often 
velocity differences are observed between 2hi> 20 vi 5 and 
zq iv in strong systems. However, for our non-detections 
the upper limits are not strongly sensitive to the exact 
placement of Zovd as the noise properties of the data 
tend to be fairly uniform over the ~ 20 km s -1 velocity 
offsets sometimes observed between different ions. All 
quoted upper limits are at the 3u level when no line could 
be fit. In cases where VPFIT was able to fit a line with 
N vp at < 3cr significance, we quote an upper limit of 
Ao vi < N vp + 3cr. Regions where VPFIT found absorp- 
tion at the 3<7 level or greater are reported as detections 
with the best fit column density. 

In most of our systems, one component of the O vi dou- 
blet is blended with a Ly-a forest line. If the other 
transition is clean of blends and shows no O vi its in- 
terpretation as a non-detection is secure. This situta- 
tion is fairly common, and we denote it as a bn system, 
for "blend/non-detection". But often one transition is 
blended with a forest line while the other shows weak 
absorption that could be either O vi or another inter- 
loping Ly-a line. These systems, hereafter called the bd 
sample ("blend-detection"), comprise a potentially seri- 
ous source of false positive O VI identifications. A conser- 
vative approach is to treat all bds as upper limits due to 
their unsure identification. However, if we assign upper 
limits to all bns and bds we would downweight many true 
detections and bias our results toward low metallicities. 
Yet if we were to restrict our sample to those systems 
where both components of the doublet are completely 
free of blending (accordingly, the dd detections and nn 
nondetections) the sample would be too small to provide 
physically interesting constraints on the metallicity of the 
IGM. At the measurement stage, our approach has been 
to treat all bn and nn systems as upper limits, and all bd 
and dd systems as detections for completeness. Later in 
the analysis, we use Monte Carlo simulations to estimate 
the degree of contamination from false positives in the 
bd subsample, and correct for their effects statistically. 

2.2. Identification and Measurement of the 
C IV Systems 

For two of the objects in the sample, we have obtained 
particularly high quality spectra to search for extremely 
weak C IV absorption features. Our intent is to com- 
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Fig. 1. — Scatter plot of detections and upper limits in the 
-?Vh i/No vi plane. Solid round points with errors represent the dd 
variety of O VI detections, and small squares represent bd detec- 
tions. Arrows indicate 3cr upper limits for undetected, thermally 
broadened O VI . The solid line drawn through the data represents 
the expected locus of points for a mean [O/H] = —2.5 and our 
fiducial UV background spectrum. 



pare the distribution of [C/H] with that of [0/H], both 
to check for consistency and to test the accuracy of the 
ionization corrections that we will use to estimate metal- 
licities. By far the best spectrum in the sample is that 
of Q1549+1919, with an average signal to noise ratio of 
~ 350 per resolution element throughout the C IV region. 
We also analyze the C iv absorption in Q1700+6416, 
whose spectrum has S/N ~ 175 per resolution element. 
While the second spectrum is not as sensitive for mea- 
suring the weakest C iv lines, we have included it to 
increase the C iv pathlength and improve the statistics 
of stronger absorbers. 

As before, we use Voigt profile fits to the Ly-a for- 
est to determine the subset of lines to be examined for 
C IV absorption. However, even with such high qual- 
ity data the predicted C iv strength drops so precipi- 
tously with TVh i that we cannot probe overdensities as 
low as with O VI . Therefore, we limit the C iv sample 
to systems with H I column densities above Nh z = 10 140 
cm -2 . Because the same objects were observed for O VI , 
we were able to fit across the Lyman series for saturated 
H I lines so the H I columns are very accurately known. 
At each sample redshift, we examine the location of the 
C iv 1548. 202A line and measure its strength if a de- 
tection is made. If no feature is present, we determine 
a 3cr upper limit on the C iv column density for both 
the thermal and turbulent broadening cases. For nearly 
all of the systems, we used the 1548. 202A component to 
measure upper limits. In the unusual case where this line 
was blended with a line from another redshift system, we 
used the 1550. 774A component to measure the limits. 

2.3. Distribution in the Hi /O VI and Hi /C IV Planes 

We have measured O vi and H I column densities for a 
total of 230 Ly-a forest lines with N H : > 10 13 6 cm" 2 in 
7 sightlines, and C IV and H I column densities for a total 
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Fig. 2. — Scatter plot of C IV and H I data. Data from only two 
sightlines (Q1549+1919 and Q1700+6416) are used. Model curve 
is as in Figure HI 



of 83 systems with JVh i > 10 140 cm -2 in two sightlines. 
These data are summarized in Figures ^ and |2 where we 
plot measurements or 3cr upper limits for each O VI and 
C IV line against its H I column density. The upper limits 
shown in the plots are for the thermal line broadening 
case. 

In the O VI figure, we have subdivided the detected sys- 
tems according to our confidence in their identification. 
Heavy round points denote highly probable O vi , where 
both components of the doublet are seen in the proper 
ratio of strengths (the dd sample). Smaller squares are 
used for the bd sample, where only one of the two doublet 
components could be measured due to Ly-a forest blend- 
ing with the other line. Again, we stress that this sub- 
sample contains spurious measurements from interloping 
H I lines - in Section 3.5 we estimate that only ~ 40% 
of these points are actually O vi . In false positice de- 
tections the measurements still provide upper limits on 
the O vi column density, so in truth some ~ 60% of the 
square points would be replaced with upper limit sym- 
bols. The C iv sample does not suffer from this sort of 
contamination, and all detections are weighted equally. 

The solid curve in each panel indicates the locus 
of points predicted for Ly-a forest clouds enriched to 
[0/H] = [C/H]=-2.5 7 and illuminated by a slightly 
modified version of the lHaardt fc Madaul l)1996|) ionizing 
background spectrum (see Sections 3.1 and 3.2 for a de- 
scription of how these model predictions are generated). 
Our observations have reached the sensitivity required to 
detect metal lines at this commonly quoted abundance 
level for much of the sample. Even at iVm < 10 140 , 
some of our O vi measurements are able to meaningfully 
probe the metal distribution, although the presence of 
several upper limits above the model curve indicates the 
increasing challenge at the lowest N-r j . 

Generally, the O vi and C IV data points follow the 

7 We use the standard notation where [X/H] represents the log 
of the abundance for element X relative to the solar level, i.e. 
[X/H] = log(X/H) - log(X/H) Q . 
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trends traced by the model predictions, with a few excep- 
tions. At high column densities (about Ah i > 2 x 10 15 ) 
most H I systems appear to contain O vi , but the O vi is 
often significantly stronger than predicted. This discrep- 
ancy is not surprising, and could be due to several fac- 
tors. First, the recipe for producing the model curve as- 
sumes a scaling between physical gas density and H I col- 
umn which may break down at higher densities where 
cosmic structure becomes nonlinear. More importantly, 
the models calculate metal strengths assuming a pure 
photoionization equilibrium, whereas the strongest sys- 
tems may be collisionally dominated. Finally, the dense 
systems are likely sites of local chemical enrichment so 
their mctallicity could be higher by an order of magni- 
tude or more. 

At lower Nn 1 there are several systems with exception- 
ally low metal to H I ratios, both for C IV and O VI . In 
some cases these lie almost an order of magnitude be- 
low the model prediction, even though the model should 
be most accurate in this regime. The presence of sev- 
eral such systems was initially a puzzle as we expected 
to find O VI and C iv absorption throughout the IGM. It 
is not obvious from Figures and [21 whether these out- 
lying systems are consistent with simple scatter about a 
trend, or whether they represent a different population 
of lines that is statistically distinct from the rest of the 
sample. This question will be addressed in detail below 
as we construct a quantitative distribution function of 
the intergalactic metallicity field. 

3. ANALYSIS 

Having fit O vi and C iv line strengths for the H I se- 
lected systems, we now translate these measurements 
into metallicity estimates for each line in the sample. 
In the following sections we describe the methods used 
to build the metallicity distribution using survival statis- 
tics, including corrections for false positive O vi identifi- 
cations. 

3.1. Estimating [O/H], [C/H] for Ly-a Forest Lines 

Using measurements of O vi and H I , the oxygen abun- 
dance for a single system is calculated as: 
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where fo vi = no vi/no and /h i = nu i/Vih are ioniza- 
tion fractions and we have assumed solar relative ele- 
mental abundances 8 . Clearly the same method may be 
applied to calculate [C/H] from our C iv line sample. 
Also, the equation may be inverted to predict No VI for 
different values of Ah i if [O /H] is already known (this is 
the method used to produce the model curves in Figures 
Q] arid El . 

Throughout, we us e the m eteoritic solar abundances 
of iGrevesse k, Sauvall l)1998j) . with A oxygcn = 8.83 and 
^carbon = 8.52 on a scale where ^.hydrogen = 12. Several 
revisions to these Solar abundanc es have been sug- 
gested since the pub lication of I Grevess e fc Sauvall 
(11998ft. including lAllende Prieto et alJ l|2002j) . 
lAllende Prieto et alJ (|200lD . lAsnlundl l|2003T) . and 

8 In Section 13.61 we discuss the implications of other choices for 
[C/O]. 
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Fig. 3. — Three representative models of the UV/X-Ray back- 
ground used in the analysis. The solid line represents our favored 
HM1.80 model. The ionization edges for several important transi- 
tions are labeled, with the convention that the "O VI " label marks 
the wavelength associated with the ionization of O v t o O VI . The 
point with errorbars represents the Scott et al. (2000) estimate of 
log J912 from the proximity effect. 



IHolweee'rl l|2001|) . Allende Prieto advocates a downward 
revision of both the carbon and the oxygen solar 
abundances by 0.13 — 0.14 dcx. Likewise, Asplund 
reduces A oxygen by 0.17 dex and A car t> n by 0.11 dex. 
Holweger uses a standard oxygen abundance that is 
0.09 dex lower, but a carbon abunda nce that is 0.07 dex 
higher . We have continued to use the|G revesse fc Sauval 
( 1998) abundances for ease of comparison with prior 
studies of intergalactic metal enrichment. Readers who 
prefer the Allende Prieto or Asplund values should 
increase all [C/H] and [O/H] estimates in this paper by 
« 0.13 — 0.14 dex. Renormalization of our measurements 
to Holweger's scale is slightly less straightforward. It 
requires a similar additive adjustment to the solar zero 
point, but also a hardening of the UV background field 
to match the relative distributions of carbon and oxygen 
(see section l3~B|) . The net effect is a downward shift in 
all of the metallicities listed throughout the paper, by 
w 0.07- 0.1 dex. 

3.2. Modeling Intergalactic Ionization Conditions 

The principal challenge of our metallicity calculation 
is an accurate estimation the ionization correction term 
in Equation^ We have run simulations of the ionization 
conditions in the IGM to calculate this term as a function 
of H I column density for each line in the sample. 

In the low density Ly-a forest, simulations suggest that 
there exists a tight correlation betwee n physical gas den- 
sity and observed H I column d ensity ijZhang et alJl!995t 
iDave et al.lll999tlSchavell200H) . We can use this relation 
to translate each H I column density in t he sample into a 
physical density, inverting Equation 8 of Schava 1)200 1|) : 
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The last three factors represent scaling corrections for 
the gas temperature, ionizing background, and fractional 
mass in diffuse gas, and are likely to be of order unity 
as argued by Schaye. This relation was derived assum- 
ing that the local collapse of Ly-a forest clouds oc- 
curs nearly in hydrostatic equilibrium. It may break 
down at high densities where the gas becomes optically 
thick (-ZVhi > 10 17 ), or at p/p < 1 where gravitational 
timescales become comparable to the Hubble time. It has 
not been verified observationally over most of the Nn i 
range considered here, since Ly-a forest systems typi- 
cally exhibit few or no other lines which could be used to 
construct an estimate of «h. However, numerical inves- 
tigations indicate that the scaling remains ac curate from 
rough ly the mean density to iVn i < 10 16 ijDave et al.l 
1999). 

The intergalactic gas is highly ionized by ambient 
ultraviolet/X-Ray radiation, whose spectrum at z ~ 2.5 
is dominated by quasar light that is reprocessed by ra- 
diative transfer through the IGM. We have run sets of 
ionization calculations for several different forms of the 
ionizing backg round spectrum, which were computed as 
described in [Haardt fc Madaul l)1996ft and kindly pro- 
vided to us by F. Haardt. A range of these spectra is 
shown in Figure |3 details about their construction are 
provided in Appendix A. The curves differ both in their 
normalization, and in their underlying UV spectral slope. 
The discrepancies reflect uncertainty in the exact shape 
of the source function in the radiative transfer equations 
- i.e., the average UV spectral slope intrinsic to QSOs. 
All of the models assume that quasar spectra in the UV 
are pure power laws: F v oc v~ a . Intergalactic H I and 
He II absorption and reemission modulate this raw spec- 
trum, producing the features seen in the 10— 1000A range 
of Figure We have named the models according to 
their input spectral slope, so that the HM1.80 model 
represents a spectrum with an input a — 1.8 power law 
that has been propagated through the IGM. The over- 
all normalizations of the spectra are specified at 912A (1 
Ry) as indicated in the plot. 

Recent attempts to measure a by averaging quasar 
spectra have produced va ried estimates in the r ange 
1.5 < a < 1.8. The original lHaardt k. Madaul (11996ft pa- 
per us ed an a = 1.5 model. Subsequently, Zhen g et al.l 
(1997) published a revised estimate of a = 1.8, based on 
FOS observations of 101 q uasars. The most r ecent deter- 
mination of the UV slope ijTelfer et alJl2002l 184 QSOs) 
revises the value again to a = 1.57, but raises the possi- 
bility of strong variations from object to object. Each of 
these papers used the entire sample of FOS quasar spec- 
tra in the HST archive at the time of its publication. The 
samples are therefore not uniformly chosen, and could be 
subject to selection effects. We have tested each of the 
different QSO models, and also a HM1.80 QSO spec- 
trum with additional H I ionizing flux from galaxies. For 
consistency below we adopt the HM1.80 quasar spectrum 
normalized to Jgi2 = —21.5 (solid line in Figurc[3J) as our 
fiducial X-Ray/UV background. We shall show that this 
choice results in the best match between the intergalac- 
tic distributions of [O/H] and [C/H]. It is also roughly 
consistent with observations of the proximity effect and 
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Fig. 4. — The ionization balances of several carbon and oxygen 
transitions relevant to C IV and O VI production, as a function of 
H I column density. 



X-Ray backgroun d at similar redshifts ijScott et al]l2000l 
IBovle et 3.1.111991 . 

Using these input spectra, we simulate the ionization 
balance of the IGM over a r ange of densities usi ng the 
CLOUDY software package (|Ferland et al.l ll99S). We 
treat the absorbing material as a plane-parallel gas slab, 
and CLOUDY calculates the ionization balance for each 
element, from which we extract the ionization fractions 
fo vi, fc iv, and /h i (Shown in Figure0J. These are com- 
bined to calculate the second term in Equation ^ along 
a grid of H I points. The exact ionization correction 
for each absorption line in the sample is then obtained 
by interpolating its H I column within the grid, and we 
combine these with the No vi, Nq iv, and Vjj i measure- 
ments from the data to calculate a value or 3cr upper 
limit for [O/H] or [C/H]. 

3.3. Metallicites of individual lines, and trends with 

In Figures[3and|ni we show the metallicity or its upper 
limit for each line in the O vi and C iv samples, plotted 
against H I column density. In this plot and those that 
follow, we have assumed the HM1.80, log J912 = —21.5 
fiducial spectrum for the UV background. The top panel 
includes all lines in each sample, and in the bottom two 
panels we have plotted detections and non-detections 
separately for viewing clarity. The dashed horizontal 
lines are drawn at an abundance of [O/H] = —2.5 to 
guide the eye, and are not fits to the data. 

Looking first at the detections, we see that over a broad 
range in H 1 density there is little change in the aver- 
age metal abundance. At the highest densities there is a 
clear trend toward increasing metallicity in the O vi sam- 
ple, with no corresponding change in the C iv distribu- 
tion. As discussed above, this could be caused by lo- 
cal enrichment, but is also likely due in part to model 
errors. In particular, at high densities (Nh : > 10 15 5 ) 
many of the systems will be multiphase absorbers with 
hot c ollisionally ionized O vi and cooler, photoionized 
C iv ( :Simcoe et al.l 12002]) . Thus at these densities the 
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Fig. 5. — Scatter plot of individual metlalicity measurements 
for all O VI lines in the Nh i > 10 13 6 sample. In the bottom 
two panels the detections and non-detections are shown sepcratcly 
for clarity. The dotted line drawn at [0/H]=-2.5 is not a fit to 
the data. The VI detections exhibit a fairly constant metallicity 
with decreasing density. The slanted trend in the upper limits is 
purely due to the detection threshold of the data. In the most 
sensitive region between 10 14 < TVh i < 10 15 , there is more scatter 
in the upper limits toward low metallicity, revealing the presence 
of a population of low metallicity systems. 



O vi and C iv might not be expected to follow each 
other too closely, and our photoionization models may 
break down. Nonetheless, at lower densities where the 
models are most accurate and where most of the lines 
are found, the metallicities of the detected systems are 
fairly uniform. To the limits of the survey (p/p ~ 1.6) 
we have detected no strong trend of decreasing metallicity 
toward the more tenuous regions of the IGM. 

This statement is based only upon the detected sys- 
tems; the upper limits in the bottom panel reveal a differ- 
ent central result: that a number of systems have metal- 
licities significantly below the trend outlined by the de- 
tected systems. This is most easily seen in the 10 14 ' 5 < 
Nh i < 10 15 range where our measurements can best 
sample the scatter both above and below the mean. 
In this regime most of the detected points fall near 
[O/H] ~ —2.5, but there are many non-detections whose 
3cr upper limits are an order of magnitude lower, with lit- 
tle or no corresponding scatter at high metallicity (this 
is perhaps best seen in the top panel of Figure [5j) . This 
pattern is likely to continue at lower densities, though we 
cannot test this hypothesis with the present data due to 
its limited signal-to-noise ratio (seen as a decrease in the 
sensitivity of the upper limits at low Nh i). 

It could be that some of our lowest limits are caused 
by an occasional failure of the density and/or ionization 
model developed above. A low metallicity results when 
we assume that much of the oxygen is in the O VI state. If 
the gas is more highly ionized (e.g. through shock heat- 
ing) or less ionized (e.g. if Equation [21 underestimates 
ur) our [O/H] estimate may be erroneously low. This 
effect should not arise from purely intrinsic scatter about 
the relation in Equation [21 in simulations the scatter's 
amplitude is only ~ ±0.05 dex in nn , which translates to 
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Fig. 6. — Scatter plot of individual metallicity measurements for 
all C IV lines in the Nh i > 10 14f) sample. Labels are as in Figure 
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~ ±0.03 dex in [O/H]. However, we cannot discount the 
possibility of the occasional pathological case that breaks 
the model. By studying a large sample of lines we hope 
to minimize the effects of individual strange systems. 

In Figure [7| we show a collection of four low metal- 
licity systems, to give a sense of the data quality used 
for our measurements. In each case, the H I profile is 
shown at bottom, along with the C IV 1549A transition 
and one or both components of O vi . We also show 
with smooth curves the predicted strength of the metal 
absorption features for [0/H] = [C/H]=-2.5 and our fidu- 
cial ionizing background spectrum. Two model curves 
are shown; the narrower assumes thermal line broaden- 
ing and the wider assumes pure turbulence. In each of 
these systems C iv should be marginally detectable, and 
O vi would have been clearly seen. For comparison, in 
Figures [8] and we show examples from the bd and dd 
samples (respectively), along with their best-fit absorp- 
tion line models. 

The qualitative results of this section are not strongly 
sensitive to the choice of ionizing background spectrum, 
though there are some slight differences between the 
models which are discussed further in Section EDfl Harder 
UV backgrounds lead to a slightly lower average [O/H] 
for the detected systems (by 0.1-0.2 dex), but there is 
still no strong trend of metallicity with H I column den- 
sity. The metallicity-density relation is affected by the 
strength of the soft X-Ray background, in the sense that 
fewer X-Rays (i.e. a softer overall background spectrum) 
would introduce a correlation of decreasing metallicity 
with decreasing density (See Appendix B for a more de- 
tailed discussion of this effect). The upper limits on 
[O /H] for the lowest metallicity systems are not affected 
by changes in the X-Ray background, but harder UV 
spectra can reduce these limits from [O/H] < —3.15 to 
[O/H] < -3.4. 

In physical terms, the H I densities probed by the 
survey extend roughly to the boundary between the 
filaments and voids seen in cosmological simulations. 
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Fig. 7. — A montage of low metallicity "nci" systems. The 
H I Ly-ct transition is shown in the bottom of each panel, along 
with the C IV 1548A transition and one or both O VI transitions. 
The continuous curves show the predicted strength of O VI and 
C IV if [0/H] = [C/H]=-2.5. The narrow profile represents thermal 
broadening, and the broad profile represents turbulent broadening. 
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Fig. 8. — A montage of detected bd systems, shown with best-fit 
model curves for the detected component of O VI . 



Within the filaments, where most baryons are found, 
the enrichment pattern does not appear to vary with 
overdensity. If strong overall metallicity gradients do 
exist in the IGM, they must occur as the transition is 
made into the voids which fill most of the cosmic volume. 
In the moderate density regions where our O vi mea- 
surements are most sensitive, we detect significant num- 
bers of metal-poor forest lines {[O/H] < —3.2), and we 
hypothesize that analogous low metallicity systems are 
likely to exist at lower densities. In the subsequent sec- 
tions, we use the individual measurements and upper 
limits to better quantify the relative frequency of the 
enriched versus metal-poor systems in our sample. 

3.4. Survival Analysis 

To translate the measurements of individual lines into a 
statistical distribution of metallicity, a proper accounting 
must be made both of detections and of nondetections 
in the total sample. This problem is well suited to the 
methods of survival analysis, a branch of statistics that 
deals with "censored" datasets, which contain mixtures 
of measurements and upper limits. 

The most general single variate survival statis- 
tic is the Kaplan-Meier product limit, which pro- 
vides a non-parametric maximum likelihood estimate 
of a distribution directly from observed data. The 
Kaplan-Meier statistic and other survival methods have 
been discuss ed extensively in the as tr onomical liter- 
ature (e.g.. iFeigelson k Nelson! Il985t iSchmitti 119851 
iWardle fc KnaprJl986j) . We follow these examples, mod- 







Fig. 9. — A montage of detected dd systems, shown with best-fit 
model curves determined using VPFIT. 
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ifying notation slightly to suit our particular application. 
We shall use the term "measurement" to describe the 
combined set of detections plus upper limits; "detection" 
and "upper limit" will be used when the specific context 
requires. We also use the notation Z trno to represent 
the actual metallicity of a line, so that for detections 
Ztmo = while for upper limits its value is unknown 
beyond that Z truc < Z%. 

Consider a sample of lines for which a single variate - 
the metallicity Z - has either been detected or an upper 
limit has been determined. We assume that the total 
set of measurements Z,-, % = 1,2, 3. ..TV has been sorted in 
order of increasing metallicity such that every Zi < Zi+i 
(where multiple measurements result in the same value, 
censored data points are considered smaller). We wish 
to use this sample to estimate the cumulative probability 
distribution P(Z > Zi): the fraction of Ly-a forest lines 
above any given metallicity threshold. 

We begin at the maximum of the distribution, since 
for all Z + > Zjv, every measurement in the sample is 
at lower metallicity, hence we estimate P[Z < Z + ) = 1. 
According to standard convention, then 



P(Z >Z + )=1- P{Z < Z+) = 0. 



(3) 



Subsequent values of P{Z > Zi) at are constructed at 
each i by stepping downwards from the iVth measure- 
ment using the rules of conditional probability. As the 
first such example, we calculate P[Z > Zn): 

P(Z>Z n ) = 1-P(Z<Z n ) (4) 
= l-P [z <z N \z<z+]-P(Z <Z+) 
= 1 _ P[z<z N \z<z+] 
where P[z<z N \z<z+] denotes the conditional probability 
that Z < Z N given that Z < Z + . It follows that P{Z > 
Zn-i) becomes 

P(Z > Zn-i) = 1 - P[z<z N -!\z<z N ] 



■P 



or for arbitrary i, 



z<z N \z<z+ 



N 



yP{Z<Z+), (5) 



P(Z>Zi) = l-l[P [z<Zjlz<z . +1 



(6) 



The conditional probabilities are estimated by count- 
ing measurements from the ranked sample as follows. For 
illustration, we first assume all data points in the sample 
are detections (i.e. no upper limits). In this event the 
conditional probability at each i would be 



P 



z<z j \z<z j+1 ] 



# detections with Z < Zj 
# detections with Z < Zj + \ 



(7) 



n(z<z j+1 ) -n (z=Zj) 
n (z<z J+1 ) 



(no upper limits). 



In a sample containing upper limits, the quantities 
n (z<z j+1 ) an d n( Z = z .) are not uniquely known, since an 
upper limit Zk measured at k > j + 1 could have a true 



metallicity Zk,tmc < Zj < Zj+i 



or Zj < Zi 



j ^ ^fc.truc < Zj+l, 

or Zj <C Zj-\-i ^ Zk^tme . In other words, an upper limit 
Zk with k > j does not provide useful information about 
the relative weights nt z<z . +1 \ and ni z — Z -\ because the 
relation between Zk^tmc, Zj and Zj + \ is ambiguous. The 



Kaplan-Meier method circumvents this ambiguity by ef- 
fectively ignoring upper limits with k > j + 1 in con- 
structing P[ Z<Zj \z<z j+1 ] > but retaining the useful infor- 
mation from upper limits with fc < j. Accordingly, the 
conditional probability P[ Z < Z] \z<z j+1 ] is expressed as 



measurements which must have Z truc < Zj 
j 



(8) 



In practice, this probability can take on one of two 
values, depending on whether Zj is a detection or an 
upper limit. If Zj is an upper limit, then all of the lines 
up to and including line j must have true metallicities 



less than Zj, hence P\< 



j/j 



If Zj 



j 3 , iioii ^ c * [ Z < Z A Z <Z j+ i\ — J/J — " 

is a detection, then the conditional probability resembes 
equation 7: 



P 



Z<Zj\Z<Zj 



1 

i-"(z=z j ) 



Zj = upper limit 
Zj = detection 



(9) 



Thus the Kaplan-Meier product limit estimate of P(Z > 
Zi), specified by Equations and [5J is a piecewise step 
function, which jumps at the Z values of detections and 
remains constant through upper limits. 

For the Kaplan-Meier estimator to be valid, two as- 
sumptions about the distribution of upper limits must 
be satisfied. First, the upper limits must be independent 
of one another. In our case, this is clearly true when 
upper limits are measured for lines that are not blends 
of two O vi or C iv components. This applies to vir- 
tually the entire sample (In some cases, detections are 
measured in regions with blends, but never upper lim- 
its). Second, the probability that a measurement will 
be censored should not correlate with the measurement 
value itself (i.e. the censoring should be random). Our 
data may not strictly meet this criterion, as very high 
metallicity points are not likely to be censored. Yet the 
sample was selected based on H I column density - not 
metallicity - and since [O/H] and [C/H] do not corre- 
late with H I column density (see FigurejSj this selection 
method should randomize the censoring. Furthermore, 
since the measurements are made in data with a range of 
signal-to-noise ratio and blending from the Ly-a forest, 
the censoring will be further randomized by variations in 
the data quality. Thus, while the censoring pattern may 
not be random at the outset, several factors conspire to 
make it approximately random in the end. This can be 
seen in Figure [SJ where upper limits are mixed in with 
measurements over much of the sample. 

We have calculated the Kaplan-Meier distribution for 
our sample of metallicities using the publicly avail- 
able ASURV VI. 2 softw are suite l|Lavallev et all 119921 
iFeigelson fc NelsonlUQS.^ . The raw output for O VI is 
shown in Figure ITU1 We ran the calculations twice, once 
using our set of upper limit measurements that assume 
thermal line broadening (solid line), and once using the 
limits measured assuming turbulent broadening (dashed 
line). Both curves have included the entire dd and bd 
O vi samples, which will contain a (possibly large) num- 
ber of false positive identifications. As expected, the 
curve for the turbulent sample is higher than that of the 
thermal sample. This is caused both by the turbulent 
upper limits being less sensitive than the thermal upper 
limits, and by the introduction of more false positive IDs 
from statistical fluctuations in the continuum. Because 
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Fig. 10. — Uncorrected cumulative distribution function of 
[O/H] for the O VI sample. Both the turbulent and thermal line 
broadening cases are shown. These curves represent upper bounds 
on the true distribution, due to contamination from false positive 
O VI identifications. 



of the level of contamination from false O vi IDs, the 
raw Kaplan-Meier distribution represents only an upper 
bound on the true metallicity distribution. 

Another point of interest is that when the analysis is 
repeated using only detected lines (i.e. the upper lim- 
its are ignored) the median metallicity increases by a 
factor of ~ 2, and the enrichment pattern is similar to 
what has be en inferred in earlier studies of intergalactic 
enrichment ([Ellison et alJl2000t iCowie k. SongaTlalll998l: 
Rauch et al.lll997aHSongaila fc Cowielll996UCowie et all 
1995bft . The median [O/H] in this case is very near —2.5, 
with scatter of 0.75 — 1.0 dcx FWHM, and the cumu- 
lative probability distribution is > 90% by [O/H]=-3.0 
as though there existed a metallicity "floor". A proper 
treatment of upper limits is therefore a critical compo- 
nent of the analysis, and will have important implications 
for the detectability of a universal, zero point metallicity 
for the IGM. 

3.5. Corrections for False Positive vi Identifications 

Using the tools developed above, we are in position to 
calculate correction factors that remove the effect of false 
positive O vi IDs from the data. We accomplish this 
using Monte Carlo techniques, where pairs of H I and 
O vi lines are generated from a predetermined distribu- 
tion of [O/H] and added to the data. We then use iden- 
tical measurement methods and survival analyses to test 
our ability to recover the input metallicity distribution. 

Our philosophy has been to avoid making corrections 
directly to the output of the Kaplan-Meier estimation. 
Instead, we note that the KM distribution is constructed 
from a list of metallicity detections and upper limits, 
where an unknown fraction of the detections are actu- 
ally false positives. For a given detection at risk of being 
a false positive (e.g., a line in the bd sample), the column 
density that is measured is unquestionably at least an 
upper limit on the actual O vi strength. Hence for some 
unknown fraction of the sample, our metallicity "detec- 
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Fig. 11. — Illustration of the Monte Carlo method to determine 
correction factors for false positive O VI IDs in the data. The 
thick solid line (smoothed in the right panel) indicates the actual 
distribution of metallicities put into the simulated spectra. In the 
left panel, we show the raw Kaplan-Meier distributions resulting 
from measurements of the simulated lines, in the presence of noise 
and Ly-a forest blending. The corrected curves in the right panel 
are created by randomly re-assigning 57% of the thermal bd systems 
as upper limits (e.g. because of Ly-o forest contamination). An 
87% correction was required when turbulent line broadening was 
assumed. Five realizations of the censoring are shown for each case 
of the assumed line broadening, to provide a sense of the random 
scatter in the distributions after our correction procedure. 



tions" are actually upper limits on the true metallicity. 
Our approach has been to use Monte Carlo simulations 
to determine this fraction of false positives. Then, we 
create several new "real" datasets from the actual data, 
where different random combinations but the same frac- 
tion of bd detections are demoted to upper limits. The 
KM distribution is calculated for each of these adjusted 
samples, and the results are averaged together to obtain 
our final estimate of P(Z > Zj). 

To produce the Monte Carlo data set, we began with 
the observed N-r i and &h i measurements for each quasar 
sightline, but reassigned a random redshift to each ab- 
sorption system. The redshifts were distributed accord- 
ing to recent measurements of dN/dz(z) in t he Ly-a for- 
est in our redshift range ijKim eTaI]l20uH) . For each 
H I line we then generated a corresponding O vi doublet 
at the same redshift. To determine the &o vi values, we 
first assembled the distribution of n, = 6hi/£>Ovi from 
observed dd systems, where our O vi identifications are 
most secure. This ratio varies from r^ — 1 for turbu- 
lent broadening to r^ = 4 for thermal broadening, with 
P{ r b) ~ 0.5 — 0.07r (determined empirically from the dd 
sample). No variation in P (?";,) was observed with H I col- 
umn density. Values of r^ were drawn at random from 
the distribution and used to calculate bo vi for each line 
in the sample based upon its H I linewidth. 

O vi column densities were calculated for each line 
by inverting Equation ^ to find No vi given [O /H] and 
Ah i • The ionization correction term as a function of 
H I strength was calculated with the same HM1.80 ion- 
izing background spectrum used for the survey data (see 
Figure |2l . The actual metallicities for each line were 
drawn from an artificial, gaussian distribution with mean 
{[O/H]) = -2.8 and a = 0.75. This contrived metallic- 
ity distribution was chosen after several iterations with 
different backgrounds and metallicity prescriptions. We 
found that the correction factors did not change sig- 
nificantly when other reasonable backgrounds or input 
metallicity distributions were used. Our final background 
and contrived abundance distribution were chosen simply 
to resemble the patterns seen in the actual data. 
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We added the simulated set of H I and VI lines to the 
actual spectra for all 7 quasar sightlines. This method 
should best replicate the actual sources of error and con- 
tamination in the real sample, including continuum fluc- 
tuations/errors, varying signal-to-noise ratio, Ly-a forest 
contamination, and instrumental or other more insidious 
effects. To minimize the possibility of artificial lines be- 
ing added where real O vi was already present, none 
of the lines were placed at redshifts within 200 km/s of 
a known line with TVh i > 10 13 5 . The metallicity and 
O vi strength were kept hidden from the user to avoid 
bias in the detection process. 

Finally, we searched the spectra for O vi absorption 
at the redshift of each artificial H I line in exactly the 
same manner that was used for the true data. We mea- 
sured O vi column densities or upper limits for both the 
thermal and turbulent broadening cases at each redshift. 
These measurements were translated into metallicities, 
and a Kaplan-Meier estimate of their distribution func- 
tion was constructed. These probability distributions are 
shown in the left panel of Figure ITT1 The thick solid line 
indicates the true metallicity distribution that was input 
to the Monte Carlo line generator. The solid and dashed 
histograms represent the measured distributions assum- 
ing thermal and turbulent line broadening, respectively. 

Clearly the measured distributions systematically over- 
estimate the input distribution, presumably either be- 
cause O vi column densities have been overestimated due 
to Ly-a forest blends, or because false positive interlop- 
ers have contaminated the sample. We cannot determine 
which particular detections are false positives, but the 
vast majority will be from the bd sample. To bring the 
measured distributions in Figure ITT1 into agreement with 
the input distribution, we have demoted varying fractions 
fd of the bd detections to upper limits and recalculated 
the Kaplan-Meier distribution. The specific set of points 
to be artifically censored was chosen randomly (for sev- 
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Fig. 12. — Contamination corrected version of the [O/H] dis- 
tribution function for thermal and turbulent broadening, where we 
have applied the censoring factors determined from Monte Carlo 
testing. Again, five censoring combinations are shown for each 
broadening case, as an indication of the residual scatter from the 
contamination correction. 
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Fig. 13. — Comparison of the Kaplan-Meier distributions for oxy- 
gen and carbon. We have assumed our fiducial HM1.80 model for 
the UV background, which was chosen to produce the best possible 
match between these two curves (i.e., we have assumed [O/C] = 0). 
For harder models of the background, C IV is overproduced rela- 
tive to O VI . This model produces an excellent statistical match 
between the two distributions. 



eral realizations), and fd was varied until the resultant 
Kaplan-Meier curves produced the closest possible match 
to the input distribution in a least-squares sense. The ex- 
ercise was repeated for both the thermal and turbulent 
distributions; the final corrected Monte Carlo measure- 
ments are shown in the right panel of Figure 1111 Five 
different lines apiece are shown for the corrected ther- 
mal and turbulent distributions, corresponding to five 
realized combinations of the random censoring. It was 
not surprising to find that the false positive rate was 
quite high. For the thermal sample, the optimal demo- 
tion fraction was fd — 57%, whereas for the turbulent 
sample fd — 87%. 

3.6. Final Estimates of the Metallicity Distribution 

Once the fd factors had been determined, we applied 
the same corrections to the actual survey data. The re- 
sults are shown in Figure 1121 Encouragingly, the cor- 
rected thermal and turbulent distribution functions are 
in quite close agreement. Comparison of Figures llOl 1111 
and^Jindicates that the magnitude and sense of our con- 
tamination corrections are similar for the Monte Carlo 
and real samples. This increases our confidence that 
the measurement and correction techniques are sound, 
at least for similar intrinsic distributions of [O/H]. As a 
test, we used the same correction factors to see whether 
we could recover a quite different artificial metallicity 
distribution - a step function with P([0/H] < —2.5) = 1 
and P{[0/H] > -2.5) = 0. Again, the Kaplan-Meier 
method performed well. A strong discontinuity was re- 
covered at [O/H] w —2.45, though it was broadened by 
- 0.5 dex. 

In Figure ED we show the Kaplan-Meier distributions 
of [O/H] and [C/H] together. For this figure only, we 
have restricted the O vi systems to have N-fti > 10 14 , 
for fair comparison with the C IV sample. These curves 
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were produced assuming a HM1.80 UV background spec- 
trum normalized to logJ9i2 = —21.5, yielding a 95% 
ionization of cosmic He n at z ~ 2.5. This choice pro- 
duced an excellent statistical match between the [O/H] 
and [C/H] curves, and it is used as our fiducial model 
throughout the paper. To arrive at this choice of the 
background spectrum, we tested a range of models with 
different spectral slopes and normalizations to determine 
which form produced the best least-squares match be- 
tween the [O/H] and [C/H] distributions. The slopes 
considered included HM1.50, HM1.55, HM1.80, and a 
soft spectrum of HM1.80 quasar light mixed with radi- 
ation from galaxies. In the QSO+galaxy spectrum, the 
escape fraction of Lyman limit photons was set at 10%. 
The normalization of each spectrum was varied between 
-21.6 < log J 912 < -21.2. 

The only spectrum besides the fiducial model that 
produced satisfactory results was the HM1.80+galaxy 
model, normalized to log J912 = —21.2 (implying 93% 
ionization of He II ). In this scenario the [O/H] and 
[C /H] distributions were both shifted to the left of those 
in Figure EH by ~ 0.25 dex. The lower metallicities can 
be attributed to extra H 1 ionizing photons supplied by 
the stars. These boost the hydrogen ionization correc- 
tion but do not strongly affect the metal lines, which 
are governed by harder UV radiation that can only be 
supplied by quasars. 

In choosing the background spectrum to match [O/H] 
with [C/H], we have implicitly assu med that \C/Q] = 
in the low density IGM, and that the lCrevesse fc Sauvall 
( 1998) solar abundances are accurate. As discussed in 
Section |3~T1 several changes have been suggested for the 
solar abundances in recent years. Substitution of the 
lAllende Prieto et alJ l|2002|) carbon and oxygen values 
should not affect our choice of UV background, since 
the zero points for these elements were revi s ed by nearly 
equal amounts. However, on the iHolwegerl l|2001f) scale, 
the solar (C/O) ratio is increased by 0.16 dex. More- 
over, observations of metal-poor galactic halo stars in- 
dicate that [C/O] ~ —0.5 may be a more appropriate 
benchmark for l ow metallicity environ ments like the Ly- 
a forest clouds ijAkerman e t aL 2003(). 

To gauge the importance of these effects, we used our 
full suite of UV background models to compare the dis- 
tributions of [O/H] vs. [C/H] + 0.5 (for the metal- 
poor star scenario), and [O/H] + 0.09 vs. [C/H] - 0.07 
(to mimic Holweger's solar abundances). Generally, we 
found that harder UV backgrounds resulted in higher 
[C/O] ratios. Accordingly, the Holweger abundances fa- 
vored a HM1.55 background but ultimately yielded simi- 
lar intrinsic [O /H] and [C /H] distributions as our fiducial 
model. For all normalizations, the matches between the 
Holweger curves were statistically inferior to those of our 
fiducial model. 

For the [C/O] = —0.5 case, a very soft background 
spectrum was needed to suppress the measured car- 
bon metallicities relative to oxygen. In fact, the 
HM1.80+galaxy spectrum was the only viable model 
for any choice of solar abundances. Normalizations in 
the range —21.6 < log Jgi2 < —21.4 all produced su- 
perb matches between the distributions of [O/H] and 
[C/H] +0.5. Thus, to produce an intergalactic [C/O] ra- 
tio resembling that of metal-poor halo stars, a soft UV 
background spectrum is required. In this scenario, the 
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Fig. 14. — The differential distribution function of [O/H], de- 
rived from the Kaplan-Meier estimator. The dashed vertical line 
represents the approximate detection limit of the survey. The dot- 
ted curve is a unit area gaussian, with mean {[O/H]) = —2.85 and 
a = 0.75, a reasonable analytic estimate of the IGM metallicity to 
the limits of our data. 



median carbon abundance is reduced to [C/H] = —3.1 
and the median oxygen abundance remains near [O/H] = 
-2.7. 

Figure El (which includes all O VI lines) depicts our 
best estimate of the metallicity distribution function of 
the Ly-a forest, using our fiducial choice of background 
spectrum. Even at the lowest observed values, the cu- 
mulative distribution does not reach 100%. Roughly 
70% of the forest has been enriched to [O/H] > —3.5. 
Thus we have detected neither a metallicity "floor" in the 
IGM, nor an abrupt transition toward chemically pristine 
gas at low densities. An abundance floor could exist at 
[O/H] < —3.5, or some of the remaining 30% of lines 
may be nearly metal-free. 

For our fiducial ionizing background model, we find a 
median intergalactic abundance of [C,0/H] = —2.81, 
quite similar to prior studies suggest i ng —2.5 < 
[O/H ] < -2.0 (ICowie fc Songailal Il998t Ellison et all 
2000), though a factor of 2 — 4 lower. The mean is for- 
mally unconstrained, as our lowest measurements are up- 
per limits. Some further insight may be gained by exam- 
ining the differential abundance distribution, which we 
have also cal culated using ASURV us ing the algorithms 
presented in iWardle fc Knand l)1986l) . The differential 
O VI distribution is shown in Figure El and is quite sim- 
ilar to the C IV distribution which is not shown. The 
vertical dotted line indicates the weakest detection in 
the sample (though there are some lower upper limits). 
The lines appear to be drawn from a single, fairly uni- 
form distribution in [O/H] that peaks near the median 
metallicity. We cannot probe the lower half of the dis- 
tribution as it lies below our sensitivity threshold, but 
we know that there must be a substantial dropoff - since 
we have already accounted for ~ 70% of the lines, the 
integrated area to the left of the sample cutoff can be 
at most ~ 30%. The observed part of the distribution 
resembles a unity normalized gaussian PDF, with mean 
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Fig. 15. — Comparison of the raw (uncorrected) [O/H] distribu- 
tion seen in the survey data (thick lines) with the raw distribution 
produced by adding simulated systems to the spectra with our best 
estimate of the differential [O/H] distribution (thin lines). Our 
simple lognormal metallicity model mimics the real spectra over a 
broad range of [O/H]. 



{[C,0/H]) = -2.85, and a = 0.75 dex. In terms of 
Z/Zq, the distribution of metallicity would be lognor- 
mal. 

To illustrate how our model corresponds to what is 
seen in the actual spectra, we show again in Figure 1151 
the raw Kaplan-Meier O vi distribution, which has not 
been corrected for false positive identifications. Both 
the thermal and turbulence broadened measurements arc 
shown, as the thick dotted and solid lines, respectively. 
On the same graph, we have plotted the raw Kaplan- 
Meier distributions of a simulated dataset with the log- 
normal metallicity distribution described above (shown 
as matching thin curves). A simple, single population 
model for the metallicity distribution (along with a uni- 
form ionizing background) reproduces the actual survey 
results extremely well, over the entire measureable range. 

3.7. Statistical tests for correlation between iVu / and 

[O/H] 

The concepts of survival analysis can also be extended 
to treat bivariate data. This allows us to quantify the 
trend of [O /H] with TVh i , even though the distribution 
of [O/H] is censored. 

Several tests have been devised to rate the significance 
of correlations in censored datasets, including the Cox 
proportional hazard method, the generalized Kendall's r 
statistic, and generalized Spearman's p. These tests are 
implemented in ASURV along with a number of tech- 
niques for linear regression of censored bivariate data. 
We have run our measurements through each of the cor- 
relation tests, and performed a Buckley-James regres- 
sion to determine the slope of any correlations between 
[O/H] and JVhi. We used the three different prescrip- 
tions for the ionizing background which produced plausi- 
ble results in the previous section: our "fiducial" HM1.80 
quasar spectrum, the HM1.80+galaxy spectrum normal- 
ized to log J912 = —21.2 (which produces [C/O] = but 
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Fig. 16. — Regression analysis to quantify trends of [O/H] as 
a function of TVjj i- Points with vertical bars represent the me- 
dian [O/H] and its interquartile range within bins of 0.25 dcx in 
Nn 1, while dotted lines represent best- fit Buckley- James regression 
models over the rang e 10 l3 ' 6 <jV H i< 10 15 . For our three favored 
models of the UV background, we find no evidence of a strong cor- 
relation between metallicity and overdensity below Nm ~ 10 15 . 
At higher densities we detect a rise in the oxygen abundance, sub- 
ject to caveats described in the text. 



with lower overall metallicities), and the HM1.80+galaxy 
spectrum normalized to logJa.12 = —21.5 (required if 
[C/O] = -0.5). 

The results are illustrated in Figure El To produce 
the data points shown in this plot, we have divided the 
sample into bins of Nn 1 , and used the Kaplan-Meier 
statistic to determine the median [O/H] and its inter- 
quartile range for each bin. The medians are shown as 
solid dots, and the quartiles are shown as terminated 
vertical bars. For some bins, our measurements were not 
sensitive enough to reach the 75th (lowest metallicity) 
percentile; these bins are indicated with arrows. When 
the distribution function does not even reach the median 
for a bin, its lowest reliable value (typically around the 
40th percentile) is indicated with a horizontal dash. For 
each plot, we also show the best-fit relation between Nn 1 
and [O/H], determined using Buckley- James regression 
over the range 10 13 <3 < 7V H 1 < 10 15 . Despite the Fig- 
ure's appearance, the line fit is not found though \ 2 min- 
imization of the binned data. Its solution more closely re- 
sembles a maximum-likelihood fit to the unbinned points 
and upper limits. 

From the binned data, it is clear that with the Nn 1 > 
10 15 systems included, a significant correlation is present 
for any choice of UV background. This is confirmed by 
the statistical tests and regression. However, we have ar- 
gued that our metallicities in this range are less reliable. 
Above ./Vh i = 1 — 2 x 10 15 , a large fraction of systems 
no longer conform to our simp le, quiescent, photoion- 
ized model of the Ly-a forest. In lSimco'e etafl (2002) we 
descibed how these absorbers contain mixtures of hot, 
probably collisionally ionized O vi , together with cooler 
photoionized C iv and lower ionization species. They 
may be the remnants of galactic winds, or shocked parcels 
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of unusually metal-rich gas in assembling haloes or larger 
structures. These environments require a more sophisti- 
cated physical model than our simple density and pho- 
toionization scalings. Accordingly, we focus on systems 
with lower densities for our regressions. 

In the range below iVm = 10 15 , there is no obvious 
signature of a metallicity gradient in the medians and 
quartiles, and in fact our regression fits yielded zero slope 
for two of our three plausible background models. For 
the fiducial UV background model (top panel of Fig- 
ure EJ), we found a 50 — 75% probability that no cor- 
relation existed using the Cox method and Kendall's r. 
Spearman's p yielded a 88% probability of no correlation. 
The slope of the regression line was d[0 / H] / d log N^i i = 
0.05 ± 0.13, statistically consistent with zero. The 
HM1.80+galaxy model normalized to log Jgi2 = —21.2 
yielded similar results, though in this case, the best fit 
metallicity decreased mildly with increasing TVh i (wih 
slope —0.07 ± 0.13). This represents a physically un- 
likely scenario. There is evidence of a metallicity-density 
correlation at the ~ 2.3cr level if we use our softest back- 
ground, the HM1.80+galaxies with log J912 = —21.5 (the 
[C/O]=-0.5 condition). For this UV background the vari- 
ous statistical tests give a correlation probability between 
85 - 95%, with d[0/H]/d\ogN Hl = 0.334 ±0.14. 

Our statistical tests support the qualitative results of 
Section 13.31 at low densities we find no strong gradi- 
ents in the intergalactic metallicity, though for the softest 
model of the UV background that we have tested, there 
is some evidence of a gradual trend. Above TVh i > 10 15 
(p/p ~ 11) there is a clear rise in the median [O/H], 
which may mean that a boundary has been crossed into 
environments with strong local enrichment. A corre- 
sponding jump is not seen in [C/H], perhaps indicating 
that the stronger absorbers are physically more complex 
or multiphased. In the quiescent, rarefied IGM, the av- 
erage abundance remains fairly constant with density to 
the limits of our survey (p/p ~ 1.6). At the very least 
we have not detected a dramatic boundary below which 
the IGM is chemically pristine. 

3.8. Comparison with Recent Results from Pixel 
Statistics 

A r ecently published study of C iv by iSchave et al.1 
(2003, hereafter S03) compares observed pixel statistics 
with numerically simulated spectra to determine the me- 
dian intergalactic carbon abundance and trend of [C/H] 
with overdensity. Our results are consistent with those 
of S03 where the two studies can be compared, and the 
agreement is particularly good regarding the shape of 
the metallicity distribution function. S03 emphasize the 
presence of a metallicity-density correlation; in our inter- 
pretation of the data presented above, plausible models 
of the UV background can lead to positive, slightly neg- 
ative, or null correlation results, so we do not claim to 
see such compelling evidence of a gradient. 

S03 favor an ionizing spectrum similar to our 
HM1.80+galaxies prescription, normalized to log J912 = 
—21.5 (at z ~ 2.5). This is the form we used to pro- 
duce [C/O] = -0.5 in the IGM, so although S03 do 
not discuss oxygen abundances we expect that their 
[O /H] distribution would be higher than the correspond- 
ing [C/H]. Using this model we found a median car- 
bon abundance of [C/H] = —3.1, and a median oxy- 



gen abundance of [O/H] = —2.7. In S03, the me- 
dian carbon abundance increases from [C/H] = —3.84 
to -3.19 over the range 10 136 < N K , < 10 15 , with 
d[C/H]/d\og V H , « 0-43±g:g| (note a factor of § conver- 
sion from d[C/H]/dlog8 to d[C / H] jd log 7V"h i, see Equa- 
tion^. Our measurements are slightly higher than S03's 
over this entire range, although they are comparable near 
V H 1 ~ 10 15 . Our slope for the [O/H] versus V H 1 fit us- 
ing this background is somewhat lower than S03's (0.33 
versus 0.43), but still within the la errors. Both we 
and S03 find lower [C/H] values than earlie r studies (e.g. 
iSongaila fc Cowielll99fiHEllison et aTll2000ft . but this dif- 
ference is primarily due to our use of a softer spectrum 
for the UV background. 

S03 have also tested their calculations using a back- 
ground spectrum similar to our fiducial model (their 
"Q" spectrum). For this case, they find a higher me- 
dian [C/H] ~ —2.8 to —2.9, and little or no slope in 
the metallicity-density relation, in good agreement with 
our results. Moreover, both methods derive a lognormal 
distribution of metallicities with a ~ 0.75 dex. 

For the regression analysis, we have taken a slightly 
different approach from S03 by dividing our sample at 
Nhi = 10 15 , reflecting our misgivings about the ion- 
ization modeling of the stronger systems. S03 do not 
split their C iv sample along these lines, though J. 
Schaye (private communication) has indicated that their 
results do not change substantially if the domain is like- 
wise restricted. Interestingly, if we perform a [C/H] 
regression using the HM1.80+galaxy UV background 
with logJgi2 = —21.5, and all of our C iv systems 
at all densities, we obtain a very similar slope to S03: 
d[C/H] /d log Nn , w 0.412±0.12, though the overall nor- 
malization of our [C/H] curve is higher by 0.26 dex. The 
errors on our [C/H] regression become extremely large 
if we only consider systems with JVh 1 < 10 15 , so it is 
difficult to compare our C IV results in this regime. We 
showed above that our [O /H] regression for this UV back- 
ground and density range produced a mild metallicity- 
density correlation. However, a fit for [O/H] including 
systems with all H 1 column densities produces a much 
steeper slope than S03, discrepant at the 3cr level. 

In the end, interpretations of a metallicity gradient are 
tied to assumptions about the shape of the UV back- 
ground and the range of Vh i appropriate to the regres- 
sion analysis. We have found three versions of the UV 
background that produce plausible relative distributions 
of [C/H] and [O/H]; for two of these there is no slope 
in the metallicity distribution, and for our third, softest 
model we see a mild slope. Where they can be com- 
pared, these results are fairly consistent with those of 
S03, who also find little or no slope for QSO dominated 
backgrounds, but a slightly larger metallicity gradient for 
QSO+galaxy backgrounds. In no case do we observe a 
dramatic decline in the metallicity that would indicate a 
chemical enrichment boundary in the IGM. For all mod- 
els, the [O/H] slope is much steeper if we include high 
column density systems, but in these cases the slope be- 
comes too severe to be consistent with our low density 
result, our [C/H] fit, or the [C/H] fit of S03. We inter- 
pret this as the onset of a more complicated ionization 
structure in the high density systems. 
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Fig. 17. — The mass fraction of intergalactic gas contained in 
each decade of H I column density for our line samples. The vertical 
dashed line indicates the low density cutoff of the O VI sample, 
which encompasses ~ 50% of Q g . 



3.9. Cosmic Mass / Volume Fractions Probed by the 

Survey 

We now present a simple method for recasting the ob- 
served fractions of absorption lines at different abun- 
dances into more physical terms. We are particularly 
interested in the mass and volume fractions of the uni- 
verse that are enriched to various levels. To estimate 
this distribution, we first recall that we have fit Voigt 
profiles to every H I line in the Ly-a forest. This was 
originally done to remove higher order H I transitions for 
the O vi search, but the line lists also provide detailed 
information on the column density distribution of H I in 
the exact redshift window of the survey. This informa- 
tion may be used to calculate the total mass fraction 
of baryons probed by the survey, and by extension the 
volume fraction. 

Our ma s s frac tion calculation follows the development 
of lSchavei ll200l . and we begin with his Equation 16: 



(10) 



Here, f(N H i,z) — d 2 J\f/dNn t dX represents the number of 
H I absorption lines per unit Nn i and per unit absorption 
pathlength 9 , and is calculated directly from the H I line 
lists. We have assumed (as in Schaye) that the gas is 
isothermal, and that the fraction of the total cosmic mass 
contained in gas is close to f2f,/fl m . Using Equation 1101 
we may infer the fraction of £l g contained in each decade 
ofJV Hl : 
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dlogiVH, dN Kl ^ KJV =.-W (11) 

This quantity is plotted in Figure 1171 with the points 
and errors taken directly from the H I distribution of 

9 The absorption pathlength is similar to a redshift interval, 
scaled to comoving units. In a flat universe, a QSO sightline rang- 
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the data. The vertical normalization of the plot is ar- 
bitrary, and the points are equally spaced in log A^h i ■ 
The distribution should be quite accurate over the range 
10 125 < iVn i < 10 15 , where distinct absorption lines are 
clearly visible in the data and line saturation is not se- 
vere. At higher Nn i there is an increased uncertainty 
in our column density measurements for some systems. 
Since the primary purpose of the line fits was to facili- 
tate the O vi search, the fit quality was not as uniformly 
high over all H I transitions for high density systems that 
did not have bearing on an O vi measurement. Further- 
more, the exact shape of the turnoff below A^h i ~ 10 13 
is the subject of some contention, as it results from a 
combination of real, physical effects and observational 
incompleteness. Though we may miss some amount of 
the mass in this region, cosmological simulations indicate 
that there should be a real and significant drop in the 
mass density contained in systems below ATh i ~ 10 125 . 
Whatever the case, the accuracy of the data is highest 
over the range which contains most of the mass. We shall 
show below that these uncertainties do not change our 
qualitative results. 
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The H I cutoff of the O VI sample (AT H , > 10 
shown in Figure El as a vertical dashed line. We wish 
to determine what fraction of the total gas mass lies to 
the right of this line. A simple inspection suggests that 
this fraction is close to one half, which turns out to be 
remarkably close to the true value. Since the data are 
equally spaced in the log, we may simply sum the points 
to the right of the line, and normalize by the sum of all 
points to determine what fraction of f2 s is probed by the 
survey. Depending on whether lines with Nn j > 10 17 5 
are included in the calculation (since they suffer from 
small number statistics), we find that the survey probes 
between 51 — 57% of the total £l g . Incompleteness at 
Ah i < 10 12 has little effect on the results, although in- 
completeness at 10 12 < A^h i < 10 12 5 could reduce our 
coverage to ~ 42% of il g if we have undercounted these 
lines by a factor of 3 (a fairly conservative estimate) . Ac- 
counting for a range of uncertainty in our measurement 
of the H I column density distribution, we estimate that 
our survey probes roughly 42 — 57% of the gas in the 
universe at z ~ 2.5 by mass, with the most likely value 
being close to 50%. 

In Figure 1181 we apply this information to determine 
the mass fraction of baryons that have been enriched to 
a given oxygen abundance, which we call the enriched 
mass function, or EMF. The two curves that bound the 
shaded region of the plot represent upper and lower lim- 
its on the EMF, derived using two different assumptions 
about the trend of [O/H] versus A^h i below our survey's 
H I threshold. If we define P([o/h].Nh i) as the differential 
probability that a line with A^h i will possess an oxygen 
abundance [O/H], the EMF is defined as 

EMF = / masB( > [0 /if]) (12) 

_ J[o/g] Ho P([o/m,N* i) dT^krA 10 ^ N *^ d ( [#] ) 
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To produce the upper bound, we suppose that the inter- 
galactic enrichment pattern seen in our data continues to 
arbitrarily low densities, filling the entire volume of the 
universe. In this case, P([o/h],n h i) is independent of A^h i 
(See Figure and the integrals over log Nh i and [O/H] 
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Fig. 18. — Upper and lower bounds on the enriched mass func- 
tion of the IGM, calculated from the O VI observations. 



may be separated. The ratio of the density integrals re- 
duces to unity, and we are left with the expression 

O 
H 



fmaee(>[0/H]) < / P([0 / H},N K1 )d 
'[O/H] 



= P(>[0/H])- (13) 

Thus the upper bound for the EMF is simply the cumu- 
lative distribution of line enrichment, i.e. the Kaplan- 
Meier distribution shown in Figure El We have implic- 
itly assumed that the lowest density regions of the IGM 
are not more heavily enriched than the highly overdense 
areas. 

To produce the EMF's lower bound, we suppose that 
all gas in clouds with ./Vjj i < 10 13,6 is chemically pris- 
tine, i.e., that we have already observed all of the heavy 
elements in the universe. Now, Equation 1131 becomes 



fmass(>[0 / H]) ~^P(>[0/H]) 
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13.6 d log jV: 



— d(logiV H] 



i0.5M 



(>[0/H]) 



(14) 

In the last step, we have used the above estimate that 
~ 50% of all baryonic mass is in clouds with iVn i > 
10 136 . The upper and lower bounds on the EMF given 
by Eauationsll3landll4lwere used to produce the shaded 
regions in Figure El 

It is more difficult to estimate the volume filling factor 
of enriched gas without reference to simulations, due to 
the complex nature of the Ly-a forest topology, and the 
lack of a simple analytic form for the volume fraction 
dV/dNn 1 . However, simulations can easily calculate the 
volume fraction within a given H I density contour, or 
which contains a cert a in fra ction of Of, - Figure 20b in 
iMiralda-Escude et alJ l)1996fl may be used to estimate 
the filling factor of the contour containing half of the 
baryons at z = 3. This number turns out to be small, 
only ~ 5% of the total volume of the universe. One 
can guess why this is the case from Figure IT7I we probe 
to an overdensity of p/p ~ 1.6, and most of the mass 
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in the universe is at densities slightly above the mean 
(the filaments), while most of the volume is at densities 
slightly below the mean (the voids). We again emphasize, 
at the lowest densities where it is possible to probe the 
intergalactic metallicity, we only trace cosmic filaments 
- we have not reached the voids. 

3.10. Contribution of Intergalactic Oxygen & Carbon to 
Closure Density 

We have estimated the quantity f^c iv using two differ- 
ent methods, initially to compare with previous studies 
and then to test the consistency of our ionization model. 
The first calculation takes a direct sum of t he observed 
C iv column densities, as in ISongailal 112001 : 



iv — 



1 

—mc, 

Pc 



(15) 



Here p c = 1.89 x 10 29 gm cm -3 is the current closure 
density, nig iv is the mass of the C iv ion, and the up- 
per sum includes all observed C IV lines. The lower sum 
represents the total absorption pathlength of the sam- 
ple (defined in the previous section; note that our for- 
mula for AX differs from Songaila's), which summed 
over our 2 C IV sightlines comes to AX — 3.27. Ap- 
plying this to the data we find f^c iv = 3.50 x I0~ s h~ 1 
(where Ho = lOO/i km s" 1 Mpc -1 ). This is in excellent 
agreement wit h other estimates of tig iv at z ~ 2.5 f rom 
the literature l|Songailal200HlBoksenberg et alJ2003l) . In 
fact, this value for tig rv seems to be observed at all red- 
shifts out to z ~ 5 ijSongaila fc Cowieil996HPettini et all 
2003). At least for z > 4, this may indicate that the 
C IV systems are enriched and /or ionized by local sources 
ijBoksenberg et al.ll200l iPettini et al.ll2003j) . 

With a benchmark in hand, we now recalculate £lg IV) 
this time starting from the observed distribution of H I in 
the forest, and applying our metallicity and ionization 
models to predict the total amount of C iv . A com- 
parison of this result with the straightforward sum from 
Equation El tests the consistency of the enrichment 
model. We begin with the H 1 column density distri- 
bution function /(jvh i,z)i calculated from our H 1 line fits 
as described in the previous section. We may perform 
a weighted integral of /(at h Ij2 ) to obtain an analogue of 
the last expression in Equation El with H 1 substituted 
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for C iv : 

E^Hi 



C 
#0 



JV H ,/( 



H i- 



(16) 



Folding in metallicity and ionization corrections, we de- 
rive the C IV mass density, again through analogy with 
Eauationll5l 



n 



C iv 



-m C i 



C 



fa: 



x / N Hl f (x , Nlll) ^dN Hl (17) 
J Jh i 

The ionization correction factors, taken from our 
CLOUDY calculations (and shown in Figure^}, are den- 
sity dependent and therefore included in the integral. 
We assume the metallicity distribution is independent 
of iVm, after Figure El To calculate the mean metal- 
licity, we assume the [C/H] and [O/H] distributions are 
identical, and use the approximation for the [O/H] dis- 
tribution described Section 3.6 - a gaussian with mean 
([§]) = -2.85 and a = 0.75. The distribution of 
IqIC/ h } i s therefore lognormal, with mean 



w io/m\ = 



exp 
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Using these values in Equation El we find iv = 
1.78 x 1Q~ 8 /i — 1 , similar to the value found from Equa- 
tion El but about a factor of 2 lower. This discrepancy 
is less significant than it may first appear, since the sum 
in Eauation ll5l is always dominated by the few strongest 
systems in the sample. In our case, three strong outliers 
from a sample of 83 lines contain over half of the total 
C iv column density. These are clearly seen in Figure 
12 they are found within very strong C iv systems that 
probably do not represent tenuous intergalactic matter. 
A re-evaluation of Eg uat ion 1151 without these three lines 
included yields Sic iv = 1-31 x 10 -8 , bringing the esti- 
mates from both methods into close agreement. This 
also suggests that a major portion of the C iv mass in 
the universe may be contained in a small number of sys- 
tems that are highly enriched or harbor unusual ioniza- 
tion conditions. 

When estimating fio vi, it is considerably more difficult 
to perform the calculation in Eauation ll5l because of Ly- 
a forest blending and false positive identifications. We 
have accounted for these effects in our determination of 
the metallicity distribution, so it is much more strightfor- 
ward to apply Eauation ll7l yielding Slo vi = 7.89 x 10~ 8 . 
By omitting the O vi or C iv ionization corrections from 
Equation El (but retaining the H I correction) we may 
estimate the total contribution of carbon or oxygen in 
all ionization states to the closure density. The result- 
ing values are listed in Table 13.101 along wi th the val- 
ues fo r the C iv and O VI ionization states. iQian et all 
( 2002) have constructed an observationally motivated en- 
richment model for the galactic halo, requiring several 
generations of supernovae from very massive stars. Their 
prediction for SIq iv from this model is an order of mag- 
nitude lower than the estimate presented here, and their 
fio vi is 1.65 times larger than our measurement. We 




Fig. 19. — Oxygen abundances for individual Ly-« forest lines, 
plotted against their gas overdensity derived from iVji i- Data from 
the dd and bd sample are shown as heavy and light solid points, 
though error bars have been suppressed for clarity. 3cr upper limits 
are shown with arrows. The continuous curves depict predictions 
for several numerical calculations of the intergalactic enrichment, 
based on phenomenological models of star formation and metal 
eject ion. The open squares and t riangles represent z = 2.3 models 
from Springcl & Hcrnauist (2003) with and without winds, respec- 
tively. The dashed line is taken from the 2 = 3 wind model of 
lAguirre et"aT] 120011) . The shaded region encompasses a possible 
range of enrichment from Hi cooled Population III stars, while the 
horizontal dotted line repr esents an absolute u pper limit on the 
metallicity of these objects iBromm ct al. 2001], also see text). 



caution the reader that estimates of [C/O] from our mea- 
surements will not be meaningful, as we have constrained 
our models of the ionizing background using assumptions 
about the [C/O] ratio. 

Equation 1171 is of further interest because it can be 
used to determine what fraction of the total C iv (or 
O vi mass) in the universe has been detected in the 
survey. This is achieved by changing the limits of the 
H I integration to match our column density cutoffs - 
iV H i > 10 13 - 6 for O vi and N K , > 10 14 for C iv . Ac- 
cording to this method, our limits probe 98.8% of the 
C iv mass and 90% of the O vi mass. 

4. DISCUSSION 

We have shown that cosmological filaments are en- 
riched with oxygen and carbon to near the mean gas 
density in the universe, and that the median [C, O/H] = 
—2.82 is similar to previous estimates, though about a 
factor of 2 lower. We have also shown several systems 
with very low abundances, implying that ~ 30% of Ly- 
a forest lines have metallicities of [C,OfH] < — 3.5. In 
this section, we examine our results in the context of cos- 
mic chemical evolution models, to assess their relevance 
for distinguishing between different enrichment mecha- 
nisms. 

In Figure El we show a variant of the metallicity- 
density relation from the O VI sample, now recast in 
terms typically used in numerical simulations. Points 
from the dd sample are shown with large symbols; the bd 
systems are shown with small dots, ~ 60% of which are 
false positives. The error bars on detected points have 
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been omitted for clarity. Along with the data points, we 
have indicated the expected metallicity-density relations 
for different chemical enrichment scenarios. We consider 
two classes of models: one where metals are expelled at 
very early times by the first generation of stars, and one 
where the ejection epoch occurs much later and is associ- 
ated with bursts of "normal" star formation and galactic 
winds. 

Recent studies of star formation in zero metallic- 
ity environments show that the first stars in the uni- 
verse probably formed via Hi cooling (lAbel et al]l2002t 
iBromm et aITl200l lOstriker fc Gnedinl l!99(fT These 
"Population III" stars were uniformly massive (M ~ 
150 — 250M Q ) and short lived, culminating in pair- 
instability supernovae that dis rupted the entire mass o f 
the star, leaving no remnant (jlHeger fc. Woosley 2002). 
Yield calculations by these authors indicate that fully 
half of the total stellar mass can be converted into met- 
als and expelled. All of this takes place prior to z ~ 30, 
at which point the stars are in comparatively small haloes 
that are close together. Thus the Population III objects 
provide a potentially efficient mechanism to enrich a large 
fraction of the cosmic volume. However, this mode of star 
formation is also self-regulating, which leads to a natu- 
ral limit to the quantity of metals that can be produced. 
When the first Population III objects form, they shine 
brightly in the near ultraviolet and emit enough pho- 
tons in the Lyman Werner bands to photodissociate all 
of the nearby H2, turning off the cooling source required 
to produce more stars. Thus the first major epoch of 
star formation and metal enrichme nt is thought to end 
at z ~ 25 - 30 i|Mackev et alJl200^1 . 

The resulting contribution to the intergalactic metal 
abundance depends on fairly uncertain models of the 
star formation rate prior to z ~ 30 and the effeciency 
with which the supernova ejecta can escape protogalac- 
tic haloes and mix with distributed gas. In the Figure we 
have shaded a region that reflects current estimates of the 
Population III metal p roduction, for mixin g efficiencies 
between 5% and 50% l|Mackev et all 12003ft . The hori- 
zontal dashed line represents a mor e strict upper l i mit to 
the Population III enrichment level. IBromm et alJ l)2001f) 
have shown that massive, Hi cooled stars will not form 
from gas with metallicity above this value, as atomic line 
cooling becomes too efficient to maintain large scale sta- 
bility, and a collapsing cloud fragments into smaller mass 
units. 

It is clear that the metals we have detected at p/p > 1 
are in excess of the Population III production levels, in 
some cases by several orders of magnitude. Some of our 
most sensitive upper limits are within a factor of 2 of 
the model predictions (recall that these are 3er limits), 
indicating that parts of the IGM could still possess Pop- 
ulation III enriched gas. It is also intriguing that an 
extrapolation of the Kaplan-Meier cumulative metallic- 
ity distribution (Figure 1121) might be consistent with a 
universal floor just below [C, O/H] = —4.0, about where 
one might expect to pick up Population III metals. This 
speculation is neither ruled out, nor required by the data. 

We now turn to chemical enrichment models based on 
more recent star formation and galactic winds. This 
mode of metal production is more familiar, as starburst 
galaxies and superwinds have been observed extensively 
at low and high (z < 5) redshifts l|Lehnert fc Heckmanl 



fl996l iPettini et aLlEoOll iDawson et alJEOQ^I . The stel- 
lar physics and populations basically resemble those in 
the local neighborhood, except that at z > 2.5 the star 
formation rate in the typical galaxy was much higher, 
and the IMF may have been tilted towards high mass 
(jPettini et al.1 [2002*1 . The physics of superwinds is ex- 
tremely complex, and a detailed understanding of their 
impact on the IGM requires numerical simulations that 
draw heavily on observationally motivated recipes for 
star formation, metal yields, and wind energetics. Once 
these ingredients arc in place, the simulations provide an 
excellent view of how ejecta interact and mix with the 
surrounding IGM. 

We have shown the predicted metallicity-density rela- 
tions from 3 different simulations with continuous curves 
in Figure 1191 The solid line c onnecting open trian- 
gles re presents a simulation from iSpringel &: Hernauistl 
( 2003) that includes local metal enrichment but no galac- 
tic winds. The solid line with open squares represents 
a separate run by the same authors, this time with a 
fiducial wind model tuned to match observations at low 
rcdshift. Both curves are for z = 2.3. The dashed cu rve 
shows the "W128" simulation of lAguirre et all l|2001f) at 
z = 3, which was calculated by applying a wind model 
to already completed numerical simulations. Again, the 
star formation and feedback parameters were adjusted to 
match local observations. 

We confirm these authors' finding that models with 
galactic winds are able to enrich the universe to the lev- 
els required by observations. Moreover, the model with- 
out winds grossly underestimates the cosmic metallicity, 
implying that essentially all of the heavy elements yet 
observed in the IGM were expelled from galaxies at rel- 
atively recent epochs, loosely 3 < z < 10. This inter- 
pretation is consistent with measurements of turbulence 
in C iv systems at similar redshift, which indicate that 
metal-rich gas is of ten stirred on 10 — 100 Myr timescales 
l|Rauch et al.ll200lj) . 

Both wind models predict a mild correlation of increas- 
ing Z '/Zq with density, and there is a weak indication 
that this trend follows the data. However, we again cau- 
tion that our metallicity measurements are less accurate 
at p/p > 100 (see Sections 3.1,3.2), and the correlation 
is very weak at lower densities. No such trend is seen in 
the C iv data. Many of the absorption lines in the range 
p/p — 100 — 300 may in fact represent snapshots of the 
evolving winds themselves, in which case our photoion- 
ization models will probably overestimate [O/H]. These 
absorbers are complex and multiphased, but their abun- 
dances have been c rudely estimated at [O/H] > —1.5 
l)Simcoe et al.1120021) . roughly consistent with the mod- 
els. 

Many of our points and upper limits fall well below 
the wind model predictions. This is most clearly seen 
at p/p ^8 — 50 and may extend to lower densities; the 
lack of measurements at p/p ~ 1 — 2 and Z/Z Q < 10~ 3 
is a selection effect, caused by the signal-to-noise depen- 
dent detection thresholds of the data. Unfortunately, our 
data are not sensitive enough to probe p/p < 1, where 
both wind models predict sharp abundance falloffs. With 
the large parameter space spanned by the wind and no- 
wind models, it seems likely that any plausible abun- 
dance pattern could be reproduced given a proper dis- 
tribution of wind ejection velocities and metal content. 
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The model curves represent a volume average of Z/Zq 
at different densities; it will be interesting to test the 
dispersion in simulated metallicities with density against 
the data, though this is beyond the scope of this paper. 

It appears Population III derived metals do not domi- 
nate the intergalactic distribution until one enters the un- 
derdense IGM. Even then, their spatial variation could 
be strongly affected if Population III star formation is 
strongly biased. It will be difficult to probe these met- 
als using direct O VI measurements, as the fluctuations 
are at the ~ 1% level from the continuum. Larger tele- 
scopes with very stable instrumentation will be required, 
and even then the effects of the Ly-a forest may render 
such measurements impossible. A more promising ap- 
proach for the far future will be to search for intergalactic 
O VII and O Vin absorption, as these are the dominant 
ionization states in the voids. However, the technical 
challenges involved in observing weak O VII and O VIII at 
high redshift will still be major obstacles for many years 
to come. 

4.1. The Ultimate Closed Box? 

We now consider a simple, global model of the chem- 
ical evolution of the universe. This model is analo- 
gous to chemical models of the galactic ISM, but rather 
than focusing on the transport of mass and chemicals 
between stars and the ISM, we study the transport of 
mass and chemicals between galaxies and the IGM, to 
determine the average chemical yield of galaxies in the 
early universe. As in galactic chemical evolution mod- 
els, we assume that metals are recycled instantaneously, 
so for each dM ga \ of mass flowing into galaxies, a cor- 
responding mass ydMgai of heavy elements is immedi- 
ately returned to the IGM. The quantity y is therefore 
a heavy element yield for galaxy formation. The instan- 
taneous approximation should work marginally well for 
the IGM, since the timescale for massive stellar evolution 
(r < 10 7 years for a M > 15M Pl star) and galactic winds 
(r ~ 10 7 - 10 8 years: IShaplev et al.ll2001(l is shorter than 
cosmic timescales (r ~ 10 9 years) at z ~ 2.5. 

We define (/> ga i as the rate of mass deposition into galax- 
ies from the IGM; this mass may either be converted into 
stars or reside in the ISM of the galaxy. In each galaxy 
that is formed, a fraction / ent of the ISM may be en- 
trained in outflows and removed from the galaxy before 
it is substantially enriched. The exchange of mass due 
to galaxy formation is then governed by the equations: 
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-{ZM lGM )=y(l - ie„t)0gal - Z(l - / cnt )0 g al (20) 

where Migm is the total mass of gas in the IGM, and 
as usual Z is the fraction of Migm bound up in heavy 
elements. By differentiating Equation I2UI and dividing 
by Equation^] we obtain 
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We have assumed that the total mass M tot = Migm + 
M ga i remains constant, i.e., there are no significant 
sources or sinks. We further assume the initial condi- 
tions Z(t ) = 0,M ga i(i o ) = 0,M gas (t o ) = M tot . This 



is equivalent to the "simple closed box" model of galac- 
tic chemical evolution. Clearly, the IGM plus galaxies 
represents the ultimate closed system. 
The solution to Equation is 

Z 1GM=y \n(4^-)- (22) 



V Migm / 

We may use this relation to constrain the metal yield 
from galaxies in the era before z ~ 2.5. We have 
calculated Z from the metallicity measurements pro- 
vided above, noting that Ziqm = (lO[°/ H l) Z@, with 
(l0[°/ ff l) ~ 1CT 2 - 2 and Z Q = 0.02, hence Zi GM = 
1.3 x 10~ 4 . The quantities M tot and Migm m &y be esti- 
mated using recent observations, first noting that 

Mot w Jh_ (23) 
Migm Lya 
where f^Lya is the contribution to closure density from 
diffuse gas in the Ly-a forest. The value il^h 2 = 0.0224 
is no w fairly secure from WMAP observations of the 
CMB llSpergel et al]l2003t) and Big Ba ng Nucleosynthe- 
sis observations ijO'Meara et all 120011) . Our estimation 
of r^Lya is somewhat less certain. A lower bound on 
this quantity has been estimated using observations of 
the average flux decrement in the Ly-a for est, either by 
match ing the observations to simulations (Rauc het alJ 
ll997cT) or by requiring that the redshift and real-space 
extent of Ly-a forest clouds are of similar magnitude 
((Weinberg et al.lll997fl . Rescaling Weinberg's conserva- 
tive estimate to a cosmology with (Om,Oa) = (0.3,0.7) 
gives n hya h 2 > 0.020 while Rauch finds fl hya h 2 > 0.021. 
It seems that there is basic agreement on a lower limit 
for the mass of the forest: OL yQ /Ob ^ 0.92. 

We can estimate an upper limit for S^Lya/^b by noting 
that Ob w fiLya + O, + Odla + !iwH + ^hot, accounting 
for most known sources of baryons in the universe. The 
various terms represent the mass of the Ly-a forest, stars, 
damped Ly-a systems, warm-hot (T = 10 5 — 10 7 K) gas, 
and hot gas in galaxy clusters, respectively. To produce a 
conservative upper limit for OLy Q , we subtract estimates 
of the mass density in damped systems and stars from 
Ob. Of all the constituents of Ob, these two are the ones 
most likely to represent local star forming regions at high 
redshift. 

The value of Odla has been estimated using extensive 
quasar surveys both at hig h and low resolution. We use 
the measurements in iStorrie-Lombardi fc Wolfel (2000) 
for 2 < z < 3, O DL a = 0.0009 ± 0.0003/ifo 1 . The stel- 
lar mass density O* is much more difficult to measure 
since it is characterized in em ission rather than absorp- 
tion. [Dickinso n et al.l l)2003t) have used IR photometry 
to integrate directly the total O, contained in luminous 
Lyman break galaxies from the HDF. Their photometry 
corresponds approximately to rest-frame optical wave- 
lengths, which should be less subject to dust extinction 
and/or bursting star formation than rest-frame UV mea- 
surements. They estimate a conservative lower bound of 
O* > 0.0002 at z ~ 2.5, with their best estimate of the 
true value being a factor of 5 larger. Combining the most 
conservative estimate of O, with the measured Odla, we 
find ilhya ;$ Ob — Odla — 0» = 0.0446, leading to the 
final constraint 



0-92 < %^ < 0.97, 



(24) 
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assuming h = 0.71. Entering these values into Equation 
1221 the corresponding range for the galactic yield is 

0.0015 < y < 0.0041. (25) 

In other words, typical galaxies in the early universe re- 
cycled 0.1 — 0.4% of their input mass back into the IGM 
as heavy elements. 

It is also interesting to compare the galaxy yield y with 
the stellar yield interior to the galaxy, ?/*. The stellar 
yield specifies the amount of metals released into the 
galaxy's ISM, y»e?M» for each gLM* of stars formed. If 
we assume that a fraction / e j of the metals formed in a 
given starburst are ejected from the galaxy into the IGM, 
we may crudely relate the stellar and galactic yields for 
a galaxy: 

y(M, + A/ism) = y*MJ cj . (26) 

Here, M» and Mism represent the amount of mass within 
the galaxy locked up in stars and gas, respectively. These 
terms may be rearranged to produce the following expres- 
sion: 

^ = i( 1 + ^)- <"> 

For bare stars, the ejected metal fraction is simply the 
ratio of the galactic and stellar yields; conversely, for 
galaxies with inefficient star formation more of the syn- 
thesized metals must escape in order to boost up the 
galactic yield. We saw in the previous section that 
most of the intergalactic enrichment is caused by rel- 
atively recent winds from star forming galaxies. Since 
high redshift galaxies show evidence for preferential en- 
richment by Type II supernovae, we estimate y* by 
integr ating the supernova yields of iWooslev fc Weaverl 
(1995) over an initial mass function, from 0.1M Q to 
1OOM . We have u s ed va ri ous prescriptions f or the IMF, 
including iSalpeterl (Il955f) . iKennicutt et alJ (|1994|) . and 
Mill er fc Scalol <|1 979(1 . which produce weighted yields 
ranging from y» = 0.001 (Miller-Scalo) to y„ = 0.011 
(Kennucutt). The yield in the solar neighborhood is 
approximately y» ~ 0.02. We may set a conservative 
lower bound on / e j, using y = 0.0015 ( Equation 12*5)) and 
y* = 0.011, with the result 

U > 14% (l + ^ff ) . (28) 

The actual value of / e j is probably higher than 14%, since 
at earlier times less of the gas has been converted into 
stars, leading to a larger M ^f M ■ This could be mitigated 
to some extent if the high redshift IMF is extremely top- 
heavy, which drives up y». 

Despite our crude models and large uncertainties, the 
overall impression is that galaxies at high redshift are 
quite efficient at returning the metals they produce to 
the IGM. On average, they will eject over ^ of their nu- 
cleosynthetic byproducts. Such vigorous recycling sug- 
gests that much of the star formation in early galaxies 
occurred in bursts, rather than a more quiescent, gradual 
mode that would allow them to retain their metals. 

Observations of local starbursts indicate that the yields 
and ejection fractions we derive are fairly reasonable for 
star forming dwarf galaxies. For example, the nearby 
dwarf NGC 1569 is driving a galactic wi nd that con- 
tains ~ 34, 000M o of oxygen at Z ~ Z Q l|Martin et all 



2002) . The parent galaxy has a combined Mism + Af* = 
1 — 1.5 x 10 8 Mq. If we assume a solar mass fraction of 
oxygen relative to total heavy element mass (0.378), the 
implied yield for this galaxy is y = 0.0007 - roughly a fac- 
tor of 2 below our lower limit for the high redshift yield. 
The authors also find that the wind carries away nearly 
all of the metals produced by the starburst, while the 
neutral gas disk holds ~ 5 times more oxygen from prior 
periods of quiescent star formation, implying a metal 
ejection fraction of / C j ~ 15 — 20% over the lifetime of the 
galaxy. We do not wish to overinterpret an isolated ex- 
ample when discussing the global properties of all galax- 
ies in the early universe. We simply note that similar 
objects at high redshift could produce and distribute the 
quantity of heavy elements seen in the IGM, provided 
their wind velocities are high enough to efficiently mix 
debris over large scales. 

5. SUMMARY AND CONCLUSIONS 

We have presented new observations of O vi and 
C iv absorption at z ~ 2.5, which were used to estimate 
the metallicity distribution function of the intergalactic 
medium. Seven quasar sightlines were used to measure 
O vi for lines with Nn i > 10 13 6 ; two of these with ex- 
ceptionally high signal-to-noise ratios were also used to 
measure C IV for lines with Nn i > 10 14 . These lim- 
its probe densities of p/p ~ 1.6 relative to the cosmic 
mean. For each line in the samples, we estimated [O/H] 
and [C /H] by applying density-dependent ionization cor- 
rections to the measurements of Nn I} No vi, and Nc rv- 
We experimented with several prescriptions for the ioniz- 
ing radiation background, eventually settling on a model 
that produced the best match between the distributions 
of [O/H] and [C/H]. This model is dominated by quasar 
light with an original spectrum of F v oc v~ XA , that has 
been reprocessed through the IGM, and normalized to a 
flux of log J = —21.5 at the H I Lyman limit. Motivated 
by observations of metal-poor galactic halo stars, we also 
experimented with the radiation field to produce a rel- 
ative intergalactic abundance of [C/O]=-0.5. A softer 
UV spectrum including galactic radiation was required 
to meet this criterion. 

Our sample contains a mixture of detections and up- 
per limits on the metallicity for each absorption line 
in the forest. The individual O vi and C iv lines are 
observed to scatter both above and below the trend of 
[C,0/H] = —2.5 observed in previous studies of the in- 
tergalactic metallicity, and several upper limits lie nearly 
an order of magnitude lower. Since the sample con- 
tains upper limits, we have used survival statistics to 
construct a Kaplan-Meier product limit estimate of the 
[O/H] and [C/H] distributions within the Ly-a forest. 
For the O vi distribution, we carefully corrected for the 
effects of false positive identifications that might result 
from interloping H I lines in the Ly-a forest. Our basic 
results may be summarized as follows: 

1 . The mixture of heavy elements within cosmological 
filaments does not differ qualitatively in regions of 
high and low density (for roughly 1 < p/p < 10 — 
15). We have not observed any significant variation 
in the [O/H] or [C/H] distribution that correlates 
strongly with Nn 1 , though there may be a weak 
correlation when we use our softest model for the 
ionizing background spectrum. 
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2. At/9/p>10 — 15, we see evidence of a change in the 
trend of [0 /H] relative to [C /H] , which we interpret 
as the onset of multi-phased ionization structure in 
many of the the stronger absorbers. 

3. Roughly 30% of lines in the Ly-a forest are enriched 
to abundances below [C,0/H] < —3.5. Thus we 
have not detected evidence for a metallicity floor 
in the IGM. If such a floor exists, extrapolation of 
the cumulative abundance distribution suggests it 
would lie in the range —5 < [C, O/H] < —4, about 
a factor of 3 lower than the limits probed by our 
survey. Nevertheless, some portion of the cosmic 
volume is very metal poor, even in regions with 
p/p ~ 10. 

4. The median abundance of the filaments is 
[C,0/H] = -2.82. By differentiating the cumu- 
lative abundance distribution, we obtain the prob- 
ability density for [O/H] and [C/H], though we 
cannot construct the lowest metallicity portion of 
the distribution with our data. For the ~ 70% 
of lines which we do measure, the distribution 
may be approximated as a gaussian with mean 
([§]) = -2.85 and a = 0.75 dex. The distri- 
bution of Z/Zq is therefore lognormal, with mean 

(id = 1(r2 ' 2 - 

It is also instructive to express our results in terms 
of an enriched mass function, defined as the cumulative 
baryonic mass fraction of the universe that is enriched 
above a given abundance level. We place upper and 
lower bounds on the EMF by assuming either that the 
abundance pattern observed at p/p > 1.6 extends to ar- 
bitrarily low density, or that we have detected all heavy 
elements and that lower density clouds are chemically 
pristine. If the true pattern lies between these two ex- 
tremes, then ~ 50 — 60% of all baryons in the universe 
have been enriched to [C,0/H] > —3.5 by z ~ 2.5. We 
have not calculated similar constraints for the enriched 
volume fraction, since even at these low column densities 
we only probe ~ 5% of the spatial extent of the universe. 
Generally, then, our conclusions apply to the gas found 
in cosmic filaments at z ~ 2.5, where most of the mass 
resides. We have not yet reached the voids which contain 
most of the cosmic volume. 

We have examined our measurements in light of two 
established models for cosmic chemical enrichment. The 
first model describes metal production from the Popula- 
tion III stars postulated to form at z > 30 from H 2 cool- 
ing, and the second describes metal-laden superwinds 
that are driven from starburst galaxies at more recent 
epochs. A comparison of these models with our data 
suggests that Population III stars do not produce enough 
metals to enrich the cosmic filaments at observed levels. 

Models incorporating galaxy outflows similar to those 
seen at low redshift can reproduce the observed average 
intergalactic metallicity. The significant scatter observed 
towards low Z/Zq indicates that there may be local vari- 
ations in the frequency or efficiency of these winds. Yet 
it seems likely that the models will be able to match 
these trends given proper tuning. The wind models pre- 



dict a substantial decrease in the average metallicity at 
H I densities slightly below the threshold of our survey. 
The measurements presented herein do not have suffi- 
cient sensitivity to detect this decline; pixel-statistical 
methods may improve on this in the near future, and 
long term prospects may include searches for intergalac- 
tic O vii and O vni , which are the dominant ionization 
species at these densities. 

To find regions of the universe whose metals are de- 
rived from Population III stars, it will be necessary to 
survey the cosmic voids. However, if Population III 
star formation is significantly biased then it is possible 
that the underdense IGM could remain nearly chemically 
pristine. The chemical enrichment of cosmological fila- 
ments is dominated by the debris of galactic outflows, 
and therefore must have taken place relatively recently. 
Taken together with our estimates of the enriched mass 
function, this indicates that roughly half of all baryons 
have either been processed through a galaxy or mixed 
with material that has done so in the first 3 Gyr after 
the Big Bang. 

Finally, we present a simple closed-box model of the 
chemical evolution of the universe. We use this model 
along with our metallicity measurements to estimate the 
average metal yield of galaxies prior to z = 2.5. For a 
galaxy, the metal yield is determined in part by the star 
formation rate and raw stellar yields, but also by the 
efficiency with which outflows can expel metals from the 
galaxy's gravitational potential. We find that the typical 
galaxy ejects ~ 0.1 — 0.4% of its formed mass into the 
IGM as heavy elements. This amounts to at least > 14% 
of the total quantity of metals synthesized within these 
galaxies. These yields are not qualitatively different from 
what is seen in some local dwarf starburst galaxies. 
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APPENDIX 

A NOTE ON MODELS FOR THE UV/X-RAY IONIZING BACKGROUND 

All of the models for the metagalactic ionizing background spectrum were provi ded to us by F. Haardt, using 
his CUBA package. This software performs calculations similar to those described in lHaardt fc MadaiJ (^996), with 
some new features. The basic principle is to integrate the best available redshift-dependent luminosity functions 
from the literature to create an average source spectrum of galaxies and quasars at each redshift. Then, the photons 
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are statistically propagated through the a parametrized model of the IGM to si mulate absorption and re emission 
at important ionization edges. The newer models extend the range of the original lHaardt fc Madai] l|199fifl spectra 
by including the X-Ray background, which turns out to be quite important for O vi in the low-density IGM. We 
have already described in Section 3.3 some of the uncertainties in the UV portion of the spectrum (e.g. the range in 
measurements of the spectral slope a) and their effects on our results. 

The model for the X-ray background is calculated independently from that of the UV/optical light in the new 
models. It is assumed to originate only from AGN, and its source function is calculated by integrating the X-ray 
luminosity function at each redshift. This function is ass umed to evolve independently of the optical/UV LF, and its 
redshift-dependent form is taken from lBovle et al.l l|1993|) . Seyfert I galaxies contribute flux at all X-ray wavelengths, 
since the IGM is exposed to the bare X-ray flux from the central AGN engine. The core of Seyfert II galaxies is covered 
by an obscuring torus which absorbs soft X-ray photons but transmits hard X-rays (E > 1 keV) . The hardness of the 
X-ray background is thus sensitive to the relative numbers of Seyfert I and II galaxies at each redshift. 

The UV/ X-ray backgro u nd m odels use weighted combinations of Seyfert I and II X-ray spectra, constructed as 
described in iMadau et alJ l)1994|) . The evolution in number ratio of Seyfert I / Seyfert II galaxies is estimated by 
assuming that AGN with B-band luminosity above a given threshold are always Seyfert I, while those below are 
Seyfert II. This is normalized to locally observed Type I/T ype II fractions. The mo dels match observations of the UV 
and hard X-ray backgrounds at low redshift, within errors flHaardt &: M adau 200j]). The local soft X-ray metagalctic 
background is less well constrained, partly because of significant galactic foregrounds. This range turns out to be most 
important for our O VI results. 

At z ~ 2.5, the UV/optical luminosity function is near its peak value over the history of the universe. Since the 
models calculate the Type I/Type II AGN fraction based on optical flux, this means that nearly all the AGN at this 
redshift (> 95%) are counted as Seyfert I, and the soft X-ray background is quite high. The models are still uncertain, 
but it seems more likely that the soft X-ray backgrounds used here would err on the high side rather than the low, 
since essentially no dilution by Type II AGN is included at these redshifts. This affects our results in the sense that a 
higher soft X-ray flux will suppress O vi production, as described in Appendix B. 

We have performed the entire analysis for several models of the ionizing background, retaining the HM1.80 model 
with log Jgi2 = —21.5 since this produces the best match between the observed [O/H] and [C/H] distributions. The 
other models we tested included HM1.50 and HM1.55 slopes, with normalizations of log J912 = —21.1, —21.4, —21.5, 
and —21.6, and an HM1.80 model with H 1 ionizing flux from galaxies included. Our optimal normalization is slightly 
lower than estimates of J912 made using the proximity effect at similar redshifts, though the discrepancy is only at the 
~ la level. The field of X-ray population modeling at high redshift is rapidly evolving with the comprehensive surveys 
being undertaken by XMM and Chandra, and much better models of the high redshift X-ray background should be 
forthcoming in the near future. 

EFFECTS OF THE X-RAY BACKGROUND ON DERIVED OXYGEN, CARBON ABUNDANCES 

It is seldom appreciated that for densities below Ah i ~ 3 x 10 14 , the oxygen in the IGM is actually cwerionized 
relative to O vi - most atoms are in the O VII and O vin states (See figure 4). In photoionization equilibrium, the 
density of O vi is governed according to the equation 



and likewise for other ionization states. Since at low densities No v *C iVo V n, much more O vi is produced by 
recombination from O vn than photoionization of O v . Accordingly, the abundance of O vi is determined by the 
number of O vn ions available for recombination. At low densities, O vn and O vin are found in similar proportions 
and their ratio is sensitive to the background flux at the O vn — >0 vin ionization edge. This ionization energy is 
in the soft X-Ray band (739eV, or 17A, indicated in Figure 0), which leads to the counterintuitive result that the 
intergalactic O vi abundance is critically sensitive to the intensity of the soft X-Ray background at low densities. A 
decrease in the soft X-Ray background favors a higher O vn abundance, and hence actually enhances O vi . 

At densities above iVm ~ 3 x 10 14 , most oxygen is in cither the O vi or O VII state. The ionic balance is then 
governed by the intensity of the UV background at the O v — >0 vi and O vi — >0 vn ionization edges (109. 5A and 
89.5 A, indicated in Figure EJ. In this regime, a decrease in the background results in fewer photons with sufficient 
energy to create O vi , thus lowering the predicted O vi column densities. Thus a substantial decrease in the X-ray 
background would cause our metallicities for the lowest density lines to move to lower values, but leave the higher 
density lines basically unchanged - introducing a dropoff in metallicity below Ah i ~ 10 14 . Our best current models 
do not reveal such a trend, but the reader should bear in mind that a significant revision of the X-ray background at 
z ~ 2.5 could affect this result. 
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